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(57) Abstract: Methods and compositions are described for analyzing complex protein mixtures, such as proteomes, using actrv- 
ity-based probes. In particular, probes that specifically react with and bind to the active form of one or more target proteins are 
employed. Labeled peptides obtained from the labeled active target proteins can be used in screening and identification procedures, 
and can be related to the identity, presence, amount, or activity of active members of the desired target protein class. The methods 
£^ and compositions described herein can be used, for example, to provide diagnostic information concerning pathogenic states, in 
identifying proteins that may act as therapeutic targets, and in drug discovery. 
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PROTEIN PROFILING PLATFORM 

[0001] This application claims priority to provisional U.S. Patent Application 

No. 60/273,007, filed March 2, 2001, which is hereby incorporated by reference in its 
entirety, including all tables, figures, and claims. 

5 FIELD OF THE INVENTION 

[0002] The field of this invention is analysis of complex protein mixtures, such 

as proteomes. 



BACKGROUND 

[0003] The following discussion of the background of the invention is merely 

10 provided to aid the reader in understanding the invention and is not admitted to 

describe or constitute prior art to the present invention. 

[0004] The biological revolution has progressed from genomics to proteomics 

as the newest frontier for gathering information concerning cellular physiology. 
Transcription of DNA is an early step in an extended process resulting ultimately in 

15 the expression of genomic information as a functional protein. Additional steps can 

include processing of the initial transcript to mRNA, translation of the mRNA into 
protein, and posttranslational processing of the protein (e.g., cleavage of the protein 
into smaller fragments, modification of the protein by glycosylation, methylation, 
acylation, phosphorylation, etc.). In addition, the protein may be activated or 

20 deactivated by interaction with other proteins and/or with small molecules (e.g., 

cofactors). Regulation of active protein expression can occur at one or more of these 
steps, and the amount of active protein in the cell at any time will vary widely with the 
state of the protein in the cell. Thus, the presence of a given gene in a cell's genome, 
or the total amount of a particular protein in a cell is not necessarily a good 

25 prognosticator of the state of the cell, as reflected by the amount of active protein 

present in the cell. 

[0005] In evaluating candidate drugs, the readout should provide an indication 

how the drug will perform in vivo. For example, an accurate evaluation of a candidate 
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drug can be obtained by using the drug in vivo and determining the effect of the drug 
on the indication and/or absorption, distribution, metabolism, and excretion 
("ADME") studies performed. However, where there are a large number of candidates 
as are available today from combinatorial libraries, and natural and other sources, 
5 substitute procedures must be available to allow for reducing the number of candidate 

drugs to be studied. To allow for large numbers of candidate drugs to be evaluated, 
while having reasonable costs and times involved, cellular surrogates are finding use. 
One can expose cells in culture to candidate drugs. The question then is what should 
be analyzed to obtain the greatest amount of accurate information relevant to the effect 
10 of the drug in the most expedient way. 

[0006] There is substantial interest in providing platforms that will provide 

answers to questions asked about the effect of candidate drugs on cells, tissues, and/or 
organisms. In order for platforms to be useful they should be efficient, reliable, and 
economic, and maximize the information provided and the predictive capability of the 
15 results. The ability to analyze samples in parallel and the reproducibility, speed, 

automation, sensitivity, and specificity of the analysis procedures can all contribute to 
maximizing the efficiency and reliability of such a platform. 

[0007] Numerous methods have been described for analyzing protein 

compositions. Some typical examples include WO 00/1 1208, which discusses mass 

20 spectrometric methods for analysis of proteins; Cravatt and Sorenson, Current Opinion 

in Chemical Biology (2000) 4(6): 663-668, which discusses chemical strategies for 
analyzing protein function; U.S. Patent No. 4,433,051, which discusses the use of a- 
difluoromethylornithine for use in protein analysis; U.S. Patent No. 6,127,134, which 
discusses difference gel electrophoresis using matched multiple dyes; Gygi et al., Proc. 

25 Natl. Acad. Sci. USA (2000) 97:9390-5, which discusses the use of 2D gel 

electrophoresis in conjunction with mass spectrometry to analyze yeast proteins; and 
Aebersold et al., PCT/US99/19415, which discusses digestion of labeled protein 
samples. 

[0008] ; Complex protein mixtures, such as proteomes, can be difficult to 
30 analyze. Not only are there many components in the mixtures, but as samples of these 

mixtures may be processed, many artifacts can be introduced into the sample, e.g., by 
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hydrolysis of amide bonds, deamination, oxidation and the like. In addition, proteins 
present in the mixture that originally derived from the same polypeptide sequence may 
have been subject to differential processing reactions, such as glycosylations, 
prenylations, etc. Moreover, in analysis procedures in which the proteins in a complex 
5 mixture are subject to proteolysis, the total number of components is greatly increased 

in comparison to the original sample. As a result, numerous fractions in a 
chromatography procedure or bands in an electrophoretic gel can be related to a single 
protein in the original sample, greatly complicating the interpretation of the data. It is 
of interest to find ways to simplify the compositions that are being analyzed to permit 
10 a more accurate and robust interpretation of the observed results. 

SUMMARY OF THE INVENTION 

[0009] Compositions and methods are provided for the identification of active 

proteins in a complex protein mixture (e.g., a proteomic mixture). In various aspects, 

15 the present invention relates to methods for combining one or more complex protein 

mixture(s) with activity based probes ("ABPs") to produce covalent conjugates of the 
active target proteins with the probes. The probes comprise a "warhead" (defined 
hereinafter) directed to a desired protein class covalently linked to a ligand, which may 
be, directly or indirectly, detectable, e.g. by fluorescence ("fABP"), and which may be 

20 used for separation and/or detection. Optionally, the ligand may be used for 

enrichment of the conjugates. 

[0010] Following reaction of the complex protein mixture with one or more 

ABPs, the resulting protein conjugates are proteolytically digested to provide probe- 
labeled peptides. In preferred embodiments, ABPs are selected such that each active 
25 target protein forms a conjugate with a single ABP, most preferably at a single discrete 

location in the target protein; thus, each conjugate gives rise to a single ABP-labeled 
peptide. Enrichment separation, or identification of one or more ABP-labeled peptides 
may be achieved using liquid chromatography and/or electrophoresis. Additionally, 
mass spectrometry may be employed to identify one or more ABP-labeled peptides by 

* 

30 molecular weight and/or amino acid sequence. In particularly preferred embodiments, 

the sequence information derived from of the ABP-labeled peptide(s) is used to 
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identify the protein from which the peptide originally derived. Variations of these 
aspects can involve the comparison of two or more proteomes, e.g. y with ABPs having 
different ligands, or, when analysis comprises mass spectrometry, having different 
isotopic compositions. 

5 [001 1] Thus, in a first embodiment, the invention relates to methods for 

determining the presence amount, or activity of one or more active target proteins in a 
complex protein mixture, most preferably a proteome. In various embodiments, these 
methods comprise one or more of the following steps: contacting a complex protein 
mixture with an ABP that specifically binds to one or more active target proteins 

10 present in the complex protein mixture; proteolyzing the active target protein(s) to 

produce a product mixture comprising ABP-labeled peptides; separating the resulting 
product mixture into one or more of its component parts, where one or more of these 
components comprise one or more ABP-labeled peptides; and generating a signal from 
those peptides bound to the ABP. Such a signal can be correlated to the presence, 

15 amount, or activity of one or more active target proteins present in the original 

complex protein mixture. 

[0012] In preferred embodiments, ABP(s) and reaction conditions are selected 

such that the relative ability of an active target protein to become labeled depends on 
the relative level of activity of that active target protein; the signal obtained from such 

20 labeling can be correlated to the activity of the active target proteins in the proteomic 

mixture. Alternatively, ABPs can be used under conditions in which all active forms of 
an active target protein are labeled, regardless of the level of activity of the particular 
active target protein. For example, the time of reaction may be extended so that the 
labeling reaction goes substantially to completion; the signal obtained from such 

25 labeling will be unrelated to the activity of the active target proteins in the proteomic 

mixture. 

[0013] Similarly, in a second embodiment, the instant invention relates to 

* 

methods for comparing the presence or amount of one or more active target proteins in 
two or more complex protein mixtures. In various embodiments, these methods 

30 comprise one or more of the following steps: contacting one or more complex protein 

* 

mixture(s) with one or more ABPs, where the ABP(s) specifically bind to one or more 
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active target proteins present in each complex protein mixture; combining the complex 

protein mixtures following this contacting step to form a combined complex protein 

t. 

mixture; proteolyzing the active target protein(s) in the complex protein mixture(s) to 
produce one or more product mixtures comprising ABP-labeled peptides; separating 
5 the resulting product mixture(s) into one or more component parts, where one or more 

of these components comprise one or more ABP-labeled peptides; and comparing the 
signals generated from corresponding ABP-labeled peptides originating from each 
original complex protein mixture. 

[0014] In preferred embodiments, the ABP(s) comprise a ligand that can be 

10 used to identify those ABP-labeled peptides originating from a particular original 

complex protein mixture (e.g. two ABPs comprising the same warhead but different 
fluorescent moieties can be used to label two different complex protein mixtures). 

[001 5] In additional preferred embodiments, the methods further comprise one 

or more of the following: the separating step comprises sequestering one or more 

15 ABP-peptide conjugates using a receptor that specifically binds to said probe; the 

ABP(s) used comprise a fluorescent moiety, and the receptor is an antibody or 
antibody fragment that binds to the fluorescent moiety; the ABP(s) used comprise a 
fluorescent moiety, and the signal generated is a fluorescent signal; the signal 
generated is a mass spectrum; the ABP(s) used comprise an isotopic label; the 

20 separating step(s) employed comprise one or more separation methods selected from 

the group consisting of affinity separation, gel electrophoresis, capillary 
electrophoresis, liquid chromatography, HPLC, electrospray ionization and MALDI; 
one or more active target proteins bound to said probe are bound to a solid support 
prior to proteolysis; one or more standard proteins are added to the complex protein 

25 mixture prior to proteolysis; the standard protein(s) are labeled with an activity based 

probe prior to addition to the complex protein mixture; the standard protein(s) are 
labeled with an ABP comprising a fluorescent moiety that is distinguishable from the 
ABP(s) contacted with complex protein mixture. 

[001 6] Additionally, in particularly preferred embodiments, the methods 

30 described herein include a step in which, prior to proteolysis, one or more components 

of the complex protein mixture not bound to an ABP are removed. This can be 
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accomplished, for example, by sequestering ABP-active target protein conjugates (e.g. y 
by binding conjugates to a solid phase, and washing out the remaining, unbound, 
components of the complex protein mixture). The conjugates can then be proteolyzed 
either while sequestered (e.g., bound to a solid phase), or following release into 
solution. 

[001 7] In another aspect, the present invention relates to methods and 

compositions for correlating a separation profile to a peptide having a known sequence 
using ABPs. In these methods, a separation profile is generated for the ABP-bound 
peptide by performing one or more separation methods, and generating a signal from 
the ABP-bound peptide. A separation profile that is characteristic of the peptide having 
said known sequence can be used to identify the peptide in subsequent separation 
procedures. 

[001 8] In various preferred embodiments, these methods comprise one or more 

of the following steps: contacting a protein sample comprising the peptide with an 
ABP that specifically binds to said amino acid sequence; proteolyzing the protein 
sample to produce a product mixture; separating peptides bound to the ABP from said 
product mixture; generating a mass spectrum from the peptides bound to the ABP to 
provide sequence information for the peptide; and generating a separation profile by 
capillary electrophoresis of peptides bound to said probe, where the separation profile 
is one or more migration time(s) of peptides bound to the ABP. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0019] Fig. 1 is a schematic drawing of a typical capillary electrophoresis - 

laser induced fluorescence instrument configuration to provide dual channel 
fluorescence detection. In the figure, "PMT" refers to the photomultiplier tube of a 
detector; "BP" refers to a band pass filter; "SP" refers to a short pass filter (a filter that 
passes shorter wavelengths), and "notch" refers to a notch filter. 

[0020] Fig. 2 is a schematic drawing of a typical separation instrument for use 

in screening and/or identification methods of the present invention. 
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[0021] Fig. 3 is a capillary electrophoresis profile for peptides generated from 

eight serine hydrolases digested in a complex protein mixture (liver cytosol). 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

5 [0022] The subject methods and compositions provide enhanced simplicity and 

accuracy in identifying changes in active protein composition of a complex protein 
mixture. Using ABPs that bind to active target proteins, the analysis of complex 
protein mixtures may be greatly simplified, particularly by providing ABPs that bind 
to active target proteins at a single site. The proteins are then proteolytically digested, 
10 resulting in a single representative ABP-labeled peptide fragment from each of the 

conjugates. Using various approaches to identification of the ABP-labeled peptide(s), 
the protein(s) from which each ABP-labeled peptide originally derived can be 
identified. 

[0023] The protein profiling platforms described herein can have a number of 

15 steps leading to the identification of active target proteins in a complex protein 

mixture. A complex protein mixture, and preferably two or more complex protein 
mixtures, e.g., a sample and a control, can be used as obtained from a natural source or 
as processed, e.g., to remove interfering components and/or enrich the target protein 
components. Each proteomic mixture to be analyzed is combined under reaction 
20 conditions with at least one ABP to produce conjugates with active target proteins. The 

ABPs used in two or more complex protein mixtures can differ as to the choice of 
ligand moiety and/or isotopic composition in order for the labeled complex protein 
mixtures to be directly compared {e.g., in the same capillary of a capillary 
electrophoresis apparatus or lane in an electrophoresis gel, or in a mass spectrometer). 
25 Additionally the ligand moiety can be used to sequester the conjugate(s) and remove 

unlabeled components from the complex protein mixture(s) or proteolytic product 
mixture(s). 

[0024] As described herein, the labeled ABP-active target protein conjugates 

are most preferably proteolytically digested prior to the next stage of enrichment 
30 and/or analysis. While the skilled artisan would expect that proteolytic digestion would 

lead to a more complex protein mixture, the methods of the instant invention 

7 
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demonstrates that such proteolysis can actually provide an advantageous simplification 
of the complex protein mixture during subsequent analysis. 

[0025] The analysis platforms described herein differ as to the methods of 

enrichment and analysis using liquid chromatography and/or electrophoresis, and/or 
5 mass spectrometry for identification and quantitation. The choice of the platform is 

affected by the size of the sample, the rate of throughput of the samples, the mode of 
identification, and the need for and level of quantitation. 

[0026] The compositions and methods described herein find use for the most 

part with biological samples, which may have been subject to processing before 

10 reaction with the ABPs. "Biological sample" intends a sample obtained from a cell, 

tissue, or organism. Examples of biological samples include proteins obtained from 
cells (e.g., mammalian cells, bacterial cells, cultured cells), particularly as a lysate, a 
biological fluid, such as blood, plasma, serum, urine, bile, saliva, tears, cerebrospinal 
fluid, aqueous or vitreous humor, or any bodily secretion), a transudate or exudate {e.g. 

1 5 fluid obtained from an abscess or other site of infection or inflammation), a fluid 

obtained from a joint (e.g. a normal joint or a joint affected by disease such as 
rheumatoid arthritis, osteoarthritis, gout or septic arthritis), or the like. 

[0027] Biological samples may be obtained from any organ or tissue (including 

a biopsy or autopsy specimen) or may comprise cells (including primary cells, 
20 passaged or cultured primary cells, cell lines, cells conditioned by a specific medium) 

or medium conditioned by cells. In preferred embodiments, a biological sample is free 
of intact cells. If desired, the biological sample may be subjected to prior processing, 

v. 

such as lysis, extraction, subcellular fractionation, and the like. See, Deutscher (ed.), 
1990, Methods in Enzymology, vol. 182, pp. 147-238. 

25 [0028] Of particular interest are samples that are "complex protein mixtures." 

As used herein, this phrase refers to protein mixtures having at least about 20, more 
usually at least about 50, even 100 or more different proteins, where the particular 
distribution of proteins is of interest. An example of such a complex protein mixture is 
a proteome, as defined hereinafter. Complex protein mixtures may be obtained from 

30 cells that are normal or abnormal in some particular, where the abnormality is 

8 
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informative as to treatment, status, disease, or the like, can be analyzed using the 
methods of the subject invention. 

[0029] The term "proteome" as used herein refers to a complex protein mixture 

obtained from a biological sample. Preferred proteomes comprise at least about 5% of 
5 the total repertoire of proteins present in a biological sample (e.g., the cells, tissue, 

organ, or organism from which a lysate is obtained; the serum or plasma, etc.), 
preferably at least about 10%, more preferably at least about 25%, even more 
preferably about 75%, and generally 90% or more, up to and including the entire 
repertoire of proteins obtainable from the biological sample. Thus the proteome may 
10 be obtained from an intact cell, a lysate, a microsomal fraction, an organelle, a 

partially extracted lysate, biological fluid, and the like. The proteome will be a mixture 
of proteins, generally having at least about 20 different proteins, usually at least about 
50 different proteins and in most cases 100 different proteins or more. 

[0030] Generally, the sample will have at least about 1 x 10' 1 1 g of protein, and 

1 5 may have 1 g of protein or more, preferably at a concentration in the range of about 0. 1 

- 50 mg/ml. For screening applications, the sample will typically be between about 1 x 
10" 11 g and about 1 x 10 3 g of protein, preferably between about 1 x W 6 g and 1x10' 
4 g of protein. For identification of labeled active target proteins, the sample will 
typically be between about 1 x 10" 9 g and about 1 g of protein, preferably between 
20 about 1 x 10 -4 g and 1 x 10" 1 g of protein. The term "about*' in this context refers to +/- 

1 0% of the amount listed. 

[003 1] The sample may be adjusted to the appropriate buffer concentration and 

pH, if desired. One or more ABPs may then be added, each at a concentration in the 
range of about InM to 20mM, preferably 10 nM to ImM, most preferably 10 nm to 
25 1 00 pM. After incubating the reaction mixture, generally for a time for the reaction to 

go Substantially to completion, generally for about 0.1 1 - 60 minutes, at a temperature 
in the range of about 5 - 40°C, preferably about 10°C to about 30°C , most preferably 
about 20°C , the reaction may be quenched. 

[0032] In one aspect of the invention, the method provides for quantitative 

30 measurement of active target proteins in biological fluids, cells or tissues. Moreover, 

9 
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the same general strategy can be broadened to achieve the proteome-wide, qualitative 
and quantitative analysis of the state of activity of proteins, by employing ABPs with 
differing specificity for reaction with proteins. The methods and compositions of this 
invention can be used to identify proteins of low abundance that are active in complex 
5 protein mixtures and can be used to selectively analyze specific groups or classes of 

proteins, such as membrane or cell surface proteins, or proteins contained within 
organelles, sub-cellular fractions, or biochemical fractions such as immunoprecipitates. 
Further, these methods can be applied to analyze differences in expressed proteins in 
different cell states. For example, the methods and reagents herein can be employed in 
10 diagnostic assays for the detection of the presence or the absence of one or more active 

proteins indicative of a disease state, such as cancer. 

* 

[0033] The subject method can be used for a variety of purposes. The method 

can be used in the diagnosis of disease, the response of cells to an external agent, e.g. a 
drug, staging diseases, such as neoplasia, identifying cell differentiation and 
1 5 maturation, identifying new proteins, screening for active drugs, determining side 

effects of drugs, identifying allelic response, identifying useful probes from 
combinatorial libraries, etc. 

[0034] The system uses ABPs specific for the active form of a protein or a 

defined group of proteins, usually directed to an active site on such proteins, and 

20 combines one or a mixture of probes, depending on the specificity of the probes and 

the variety in the group or groups of related proteins to be assayed. In the present 
invention, it is not necessary that there be no reaction of an ABP with inactive target 
protein(s). Rather, an ABP is defined as being "specific for," as "specifically reacting 
with," or as "specifically binding to," active target protein(s) if the ABP provides at 

25 least about twice the amount of signal from ABP labeling of active protein when 

compared to an. equivalent amount of inactive target protein. Preferably the signal 
obtained from active target protein(s) will be at least about five fold, preferably 10 
fold, more preferably 25 -fold, even more preferably 50-fold, and most preferably 100- 
fold or more, greater than that obtained from an equivalent amount of inactive target 

30 protein. 
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[0035] The term "target protein" as used herein refers to one or more proteins, 

an active site of which becomes labeled by one or more ABPs when the target protein 
is in its "active" form. The reaction mixture can provide conditions under which the 
ABP(s) react substantially preferentially with active target proteins. Particularly 
preferred target proteins are enzymes; other preferred target proteins include receptors, 
transcription factors, G-proteins, and the like. 

* 

[0036] The term "active target protein" refers to a target protein that is in its 

native conformation and is able to interact with an entity with which it normally 
interacts, e.g. enzyme with substrate and cofactor, receptor with ligand, etc., e.g. 
phosphorylated active form as compared to unphosphorylated inactive form and vice 
versa. In effect, the protein is in the form in which it can carry out its biological 
function. 

[0037] The term "inactivated" as used herein refers to a sample that has been 

treated so that at least a portion of target proteins that were active in the original 
sample are rendered inactive. An "inactive protein" can result from various 
mechanisms such as denaturation, inhibitor binding, either covalently or non- 
covalently, mutation, secondary processing, e.g. phosphorylation or 
dephosphorylation, etc. Functional states of proteins or enzymes as described herein 
may be distinct from the level of abundance of the same proteins or enzymes. 
Inactivated samples may be used to validate the activity-specific binding of ABPs as 
described herein. 

> 

[0038] The term "untreated" as used herein refers to a sample that has not been 

exposed to one or more conditions as compared to a second sample not exposed to 
such conditions. An untreated sample may be a sample that has not been inactivated; 
alternatively, an untreated sample may be one not exposed to one or more molecules 
(e.g., drug lead compounds) in a screening assay. Thus the compostions and methods 
described herein may comprise comparing a complex protein mixture obtained from 
cell(s), tissue(s), or organism(s) treated with one or more compounds (e.g., lead 
compounds in drug discovery) to a complex protein mixture obtained from cell(s), 

r. i 

tissue(s), or organism(s) not so treated. ABP-labeled peptides from the two samples 
may be compared for relative signal intensity. Such methods may indicate alterations 

11 
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in active protein content due to the treatment regimen. Additionally, such methods can 
also differentiate between treatments that act by direct inhibition of specific proteins 
("primary effects**) versus treatments that affect active protein content upstream, e.g., 
by altering expression of protein(s) ("secondary effects"). 

5 [0039] An "active site'* of a protein refers to an area on the surface of a protein, 

e.g., an enzyme molecule or surface membrane receptor, to which a binding molecule, 
e.g. substrate or reciprocal ligand, is bound and results in a change in the protein 
and/or ligand . For a receptor, the conformation may change, the protein may become 
susceptible to phosphorylation or dephosphorylation or other processing. For the most 

1 0 part, the active site will be the site(s) of an enzyme where the substrate and/or a 

cofactor bind, where the substrate and cofactor undergo a catalytic reaction; where two 
proteins form a complex, e.g. the site at which a G protein binds to a surface 
membrane receptor, two kringle structures bind, sites at which transcription factors 
bind to other proteins; or sites at which proteins bind to specific nucleic acid 

15 sequences, etc. 

[0040] In referring to affinity for an ABP to a target protein, one is concerned 

with the on-rate of the ABP with the target protein, since there is a negligible off-rate, 
where the ABP covalently bonds to the target protein. One can determine relative on- 
rates between ABPs by having less than a stoichiometric amount of the target protein 
20 as compared to the total amount of one or more ABPs and then measuring the relative 

amounts of the conjugates for each of the ABPs. In this way one can obtain a measure 
of the relative activity of each of the ABPs toward the active target protein, which for 
the purposes of this invention may be considered the affinity, if not the binding 
affinity, of the ABP for the target protein. 

25 [0041] Exemplary target proteins include enzymes, such as oxidoreductases, 

hydrolases, ligases, isomerases, transferases, and lyases (and including such enzymes 
or enzyme groups as serine hydrolases, metallo-hydrolases, dehydrogenases, e.g. 
alcohol and aldehyde dehydrogenases, and nucleotide triphosphate (NT)-dependent 
enzymes), although, the invention envisions ABPs which recognize any protein, e.g., 

30 enzyme, family. Other target proteins include proteins that bind to each other or to 

nucleic acids, such as transcription factors, kringle structure containing proteins, 

"12 
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nucleic acid binding proteins, G-protein binding receptors, cAMP binding proteins, 
etc. 

[00421 Structure of ABPs 

r00431 The ABPs of the present invention comprise a warhead, linked via a 

5 linker moiety to a ligand. Suitable ABPs for use in the invention are disclosed 

generally, for example, in the following: PCT Application No. PC17US99/19415, WO 
00/1 1208, entitled "Rapid Quantitative Analysis of Proteins or Protein Function in 
Complex Mixtures"; PCT Application No. PCT/US00/34187, WO 01/77684, entitled 
"Proteomic Analysis"; PCT Application No. PCT/US00/34167, WO 01/77668, entitled 
10 "Proteomic Analysis"; U.S. Provisional Application No. 60/266,687, entitled, 

"Activity Based Probe Analysis," filed February 5, 2001; PCT Application No. 

, entitled "Activity Based Probe Analysis" (Attorney Docket No. 063391- 

0202), filed February 5, 2002; each of which is hereby incorporated by reference in 
their entirety, including all tabes, figures, and claims. 

1 5 [0044] As will be described hereinafter, each of the warhead, the linker moiety 

("L"), and the ligand ("X") may be independently selected to provide different target 
specificities. Each of these components of an ABP is described in additional detail 
below. 

[0045] The term "warhead" as used herein refers to the portion of an ABP that 

20 is directed to and binds with an active site of an active target protein. The warhead 

comprises a functional group ("F") and an optional affinity moiety ("R"). Functional 

group (F) refers to one or more chemical groups within an ABP that specifically and 

covalently bond to the active site of a protein. The functional group may, by its very 

structure, be directed to the active site of a target protein. Alternatively, a separate 

25 affinity moiety (R) may be provided. Affinity moiety (R) refers to a chemical group, 

« 

which may be a single atom, that is conjugated to the functional group or associated 
with the linker moiety that provides enhanced binding affinity for protein targets 
and/or changes the binding profile of the warhead. The affinity moiety is preferably 
less than 1 kilodalton in mass. 
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[0046] The term "linker moiety* refers to a bond or chain of atoms used to link 

one moiety to another, serving as a covalent linkage between two or more moieties. 

[0047] The term "ligand" as used herein refers to a molecule that can be used 

to detect and/or capture the ABP in combination with any other moieties that are 
bound strongly to the ligand, so as to be retained in the process of the reaction of the 
functional group with the target active protein. The ligand may be added to the 
warhead-linker moiety combination after reaction with the target protein, to form the 
complete ABP. For this purpose, the warhead-linker moiety combination will include 
a chemically reactive functionality, normally not found in proteins, that will react with 
a reciprocal functionality on the ligand, e.g. vic.-diols with boronic acid, 
photoactivated groups, such as diazo bisulfites, etc. The warhead-linker moiety is then 
reacted with the ligand to complete the ABP. 

[0048] The ABP will have an affinity for an active site, which may be specific 

for a particular active site or generally shared by a plurality of related proteins. The 
affinity may be affected by the choice of the functional group, the linker moiety, the 
binding moiety, the ligand, or a combination thereof. As described hereinafter, one or 
more ABPs may be designed that exhibit specificity for a single target protein, or that 
exhibit specificity for a plurality of targets that may be structurally or functionally 
related. 

[0049] The ABPs of the subject invention may be illustrated by the following 

formula: 

R* (F - L) - X 

where * indicates that R may be optionally present, and L, if present, may be bound to 
either F, L or both F and L. 

[0050] Exemplary Fs as used in an ABP of the invention include an alkylating 

agent, acylating agent, ketone, aldehyde, sulphonate or a phosphorylating agent. 
Examples of particular Fs include, but are not limited to fluorophosphonyl, 
fluorppfiosphoryl, fluorosulfonyl, alpha-haloketones or aldehydes or their ketals or 
acetals, respectively, alpha-haloacyls, nitriles, sulfonated alkyl or aryl thiols, 
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iodoacetylamide group, maleimides, sulfonyl halides and esters, isocyanates, 
isothiocyanantes, tetrafluorophenyl esters, N-hydroxysuccinimidyl esters, acid halides, 
acid anhydrides, unsaturated carbonyls, alkynes, hydroxamates, alpha- 
halomethylhydrdxamates, aziridines, epoxides, or arsenates and their oxides. Sulfonyl 
groups may include sulfonates, sulfates, sulfinates, sulfamates, etc., in effect, any 
reactive functionality having a sulfur group bonded to two oxygen atoms. Epoxides 
may include aliphatic, aralkyl, cycloaliphatic and spiro epoxides, the latter exemplified 
by fumagillin, which is specific for metalloproteases. 

[0051] The linker moiety L, which potentially can be as short as a covalent 

bond, is preferred to be other than a bond. Since in many cases, the synthetic strategy 
will be able to include a functionalized site for linking, the functionality can be taken 
advantage of in choosing the linking moiety. The choice of linker moiety has been 
shown to alter the specificity of an ABP. See, e.g., Kidd et aL, Biochemistry (2001) 
40: 4005-15. For example, an alkylene linker moiety and a linker moiety comprising a 
repeating alkyleneoxy structure (polyethylene glycols, or'TEG"), have distinct 
specificities and provide distinct protein profiles. Thus, one of skill in the art can 
select the linker moiety of the ABP in order to provide additional specificity of the 
ABP for a particular protein or protein class. 

[0052] Linker moieties include among others, ethers, polyethers, diamines, 

20 ether diamines, polyether diamines, amides, polyamides, polythioethers, disulfides, 

silyl ethers, alkyl or alkenyl chains (straight chain or branched and portions of which 
may be cyclic) aryl, diaryl or alkyl-aryl groups, having from 0 to 3 sites of aliphatic 
unsaturation. While normally amino acids and oligopeptides are not preferred, when 
used they will normally employ amino acids of from 2-3 carbon atoms, i.e. glycine 
25 and alanine. Aryl groups in linker moieties can contain one or more heteroatoms (e.g., 

N, O or S atoms). The linker moieties, when other than a bond, will have from about 1 
to 60 atoms, usually 1 to 30 atoms, where the atoms include C, N, O, S, P, etc., 
particularly C, N and O, and will generally have from about 1 to 12 carbon atoms and 
from about 0 to 8, usually 0 to 6 heteroatoms. The number of atoms referred to above 
30 are exclusive of hydrogen in referring to the number of atoms in a group, unless 

indicated otherwise. 

15 
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[0053] Linker moieties may be varied widely depending on their function, 

including alkyleneoxy and polyalkyleneoxy groups, where alkylene is of from 2-3 
carbon atoms, methylene and polymethylene, polyamide, polyester, and the like, where 
individual monomers will generally be of from 1 to 6, more usually 1 to 4 carbon 
5 atoms. The oligomers will generally have from about 1 to 10, more usually 1 to 8 

monomelic units. The monomelic units may be amino acids, both naturally occurring 
and synthetic, oligonucleotides, both naturally occurring and synthetic, condensation 
polymer monomeric units and combinations thereof. 

[0054] The ligand portion permits capture of the conjugate of the target protein 

1 0 and the probe. The ligand may be displaced from the capture reagent by addition of a 

displacing ligand, which may be free ligand or a derivative of the ligand, or by 
changing solvent (e.g., solvent type or pH) or temperature or the linker may be cleaved 
chemically, enzymatically, thermally or photochemically to release the isolated 
materials (see discussion of the linker moiety, below). 

1 5 [0055] Examples of ligands, X, include, but are not limited to, detectable labels 

such as fluorescent moieties and electrochemical labels, biotin, digoxigenin, maltose, 
oligohistidine, 2,4-dintrobenzene, phenylarsenate, ssDNA, dsDNA, a polypeptide, a 
metal chelate, and/or a saccharide. Examples of ligands and their capture reagents also 
include but are not limited to: dethiobiotin or structurally modified biotin-based 

20 reagents, including deiminobiotin, which bind to proteins of the avidin/streptavidin 

family, which may, for example, be used in the forms of strepavidin- Agarose, 
oligomeric-avidin-Agarose, or monomeric-avidin-Agarose; any vicinal diols, such as 
1,2-dihydroxyethane (HO-CH 2 -CH 2 -OH), and other 1,2-dihyroxyalkanes including 
those of cyclic alkanes, e.g., 1,2-dihydroxycyclohexane which bind to an alkyl or aryl 

25 boronic acid or boronic acid esters, such as phenyl-B(OH) 2 or hexyl-B(OEthyl) 2 which 

may be attached via the alkyl or aryl group to a solid support material, such as 
Agarose; maltose which binds to maltose binding protein (as well as any other 
sugar/sugar binding protein pair or more generally to any ligand/ligand binding protein 
pairs that has properties discussed above); a hapten, such as the dinitrophenyl group, to 

30 which an antibody can be generated; a ligand which binds to a transition metal, for 

example, an oligomeric histidine will bind to Ni(II), the transition metal capture 
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reagent may be used in the form of a resin bound chelated transition metal, such as 
nitrilotriacetic acid-chelated Ni(IT) or iminodiacetic acid-chelated Ni(II); glutathione 
which binds to glutathione-S-transferase. For the most part, the ligands will be 
haptens that bind to a naturally occurring receptor, e.g. biotin and avidin, or an 
5 antibody or will be a detectable label, that is also a hapten. 

[0056] One may use chemical affinity resins, e.g. metal chelates, to allow for 

digestion of proteins on the solid phase resin and facilitate automation. One example 
of this is the use of immobilized nickel (II) chelates to purify peptides that have six 
consecutive histidine residues (His-6 tag) (as described in the Invitrogen product 

1 0 brochureProBond ™ Resin (Purification) Catalog nos. R80 1 -01 , R80 1-15 Version D 

000913 28-0076), which could be adapted to include non-peptidic chemical linkage 
coupling a series of imidazole-containing moieties. Alternative chemical attachments 
include phenyldiboronic acids (described in Bergseid, M. et al. Biotechniques (2000) 
29(5), 1 126-1 133), and disulfide reagents (described in Daniel, SM et al., 

1 5 Biotechniques (1998) 24(3), 484-489). Additionally, chemical affinity tags that are 

useful in combinatorial synthesis could be adapted for modified peptide purification 
(reviewed in Porco, J A (2000) Comb. Chem. High Throughput Screening 3(2) 93-102 

[0057] The term "fluorescent moiety" refers to a ligand that can be excited by 

electromagnetic radiation, and that emits electromagnetic radiation in response in an 

20 amount sufficient to be detected in an assay. The skilled artisan will understand that a 

fluorescent moiety absorbs and emits over a number of wavelengths, referred to as an 
"absorbance spectrum" and an "emission spectrum." A fluorescent moiety will exhibit 
a peak emission wavelength that is a longer wavelength than its peak absorbance 
wavelength. The term "peak" refers to the highest point in the absorbance or emission 

25 spectrum. 

[0058] The fluorescent moiety may be varied widely depending upon the 

protocol to be used, the number of different probes employed in the same assay, 
whether a single or plurality of lanes are used in the electrophoresis, the availability of 
excitation and detection devices, and the like. For the most part, the fluorescent 
30 moieties that are employed as ligands will absorb in the ultraviolet, infrared, and/or 

most preferably in the visible range and emit in the ultraviolet, infrared, and/or most 

17 



WO 02/071066 



PCT/US02/06234 



preferably in the visible range. Absorption will generally be in the range of about 250 
to 750 nm and emission will generally be in the range of about 350 to 800nm. 
Illustrative fluorescent moieties include xanthene dyes, naphthylamine dyes, 
coumarins, cyanine dyes and metal chelate dyes, such as fluorescein, rhodamine, 
5 rosamine, the BODIPY dyes (FL, TMR, and TR), dansyl, lanthanide cryptates, erbium, 

terbium and rutheniiim chelates, e.g. squarates, and the like. Additionally, in certain 
embodiments, one or more fluorescent moieties can be energy transfer dyes such as 
those described in Waggoner et al. 9 U.S. Patent no. 6,008,373. The literature amply 
describes methods for linking fluorescent moieties through a wide variety of linker 
10 moieties to other groups. The fluorescent moieties that find use will normally be under 

2kDal, usually under lkDal. 

[0059] Preferred fluorescent moieties can include elaborated conjugated pyran 

molecules, including xanthenes. Such molecules include eosin, erythrosin, fluorescein, 
Oregon green, and various commercially available Alexa Fluor ® dyes (Molecular 
15 Probes, Inc.). Structural examples of such dyes include: 
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[0060] Particularly preferred fluorescent moieties are the rhodamine dyes. 

These molecules typically have the general structure: 




[006 1 ] Where X is -C0 2 H, or -S0 3 H; Y is -H, -CH 3 , or together with R forms 

a six-membered ring; Z is -H or together with R forms a six-membered ring; and R is - 
H, -CH 3 , -CH2CH3, or together with Y or Z forms a six-membered ring. Rhodamine 
molecules such as tetramethylrhodamine, 5-carboxytetramethylrhodamine, 6- 
carboxytetramethylrhodamine, carboxyrhodamine-6G, rhodamine-B sulfonyl chloride, 
rhodamine-red-X, and carboxy-X-rhodamine are well known to those of skill in the 
art. See, e.g., Handbook of Fluorescent Probes and Research Products, Molecular 
Probes, Inc., 2001, which is hereby incorporated by reference in its entirety. 
Advantageous properties of rhodamines include high quantum yields, low sensitivity 
of fluorescence over a pH range of from about pH 3 to about pH 8, advantageous water 
solubility, good photostability, and absorption of light in the visible spectrum. 
Particularly preferred fluorescers are 5-carboxytetramethylrhodamine and 6- 
carboxytetramethylrhodamine. 

[0062] Other preferred fluorescent moieties include the 

BODIPY dyes, which are elaborations of a 4-bora-3a,4a-diaza-s-indacene structure. 
Exemplary structures are provided below: 
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H,C F " F (CH^COOH ^ \ 




(CH^COOH 



[0063] Yet other preferred fluorescent moieties include the cyanine dyes, 

conjugated structures comprising a polymethine chain terminating in nitrogen atoms. 
Typically, the nitrogens are themselves part of a conjugated heterocycle. An 
exemplary structures is provided below: 



HorHOaS 




SOaHorH 



COOH 



[0064] Also of interest for use as ligands are matched dyes as described in U.S. 

Patent No. 6,127,134, which is hereby incorporated by reference in its entirety, 
including all tables, figures, and claims, which is concerned with labeling proteins with 

1 0 dyes that have different emissions, but have little or no effect on relative migration of 

labeled proteins in an electrophoretic separation. Of particular interest are the cyanine 
dyes disclosed therein, being selected in '134 because of their positive charge, which 
matches the lysine to which the cyanine dyes bind. In addition there is the opportunity 
to vary the polyene spacer between cyclic ends, while keeping the molecular weight 

1 5 about the same with the introduction of an alkyl group in the shorter polyene chain dye 

to offset the longer polyene. Also described are the BODIPY dyes, which lack a 
charge. The advantage of having two dyes that similarly affect the migration of the 
protein would be present when comparing the native and inactived samples, although 
this would require that in the inactivated sample at least .a portion of the protein is 

20 monosubstituted. 
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[0065] In each of the foregoing examples of preferred fluorescent moieties, 

carboxyl groups can provide convenient attachment sites for linker moieties. In the 
particularly preferred 5- and 6-carboxyrhodamine molecules, the 5- or 6- carboxyl is 
particularly preferred as an attachment site: 



Z Z 




COOH 



[0066] In general, any affinity label-capture reagent commonly used for 

affinity enrichment, which meets the suitability criteria discussed above, can be used 
in the method of the invention. Biotin and biotin-based affinity tags are particularly 
illustrated herein. Of particular interest are structurally modified biotins, such as 
deiminobiotin or dethiobiotin, which will elute from avidin or streptavidin 
(strept/avidin) columns with biotin or under solvent conditions compatible with ESI- 
MS analysis, such as dilute acids containing 10-20% organic solvent. For example, 
deiminobiotin tagged compounds will elute in solvents below about pH 4. 

[0067] Design of ABPs and libraries of ABPs 

[0068] ABPs may be designed and synthesized using combinatorial chemistry 

and/or rational design methods. A detailed description of an ABP design strategy, in 
which a fluorescent moiety can act as a ligand, is provided in PCT Application No. 

, entitled "Activity Based Probe Analysis" (Attorney Docket No. 063391- 

0202), filed February 5, 2002, PCT Application No. PCT/US00/34187, WO 01/77684, 
entitled "Proteomic Analysis" and PCT Application No. PCT/US00/34167, WO 
01/77668, entitled 'Troteomic Analysis," each of which is hereby incorporated by 
reference in its entirety, including all tables, figures, and claims. As described therein, 
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goals of a design strategy are to provide ABPs that are able to react covalently with a 
targeted group of active proteins, while minimizing non-specific labeling. 

[0069] One strategy that may be utilized to design ABPs is to first identify a 

potentially reactive amino acid that is conserved at the sequence level in the region 
5 that is targeted for ABP labeling, and to select an appropriate functional group for 

attachment to an appropriate affinity moiety (e.g., an adenosine analogue for ATP 
binding proteins). Potential reactive amino acids include serine, threonine, tyrosine, 
lysine, arginine, aspartic acid, glutamic acid, asparagine, glutamine, methionine, and 
cysteine. One can also consider the composition of the linker moiety between the 

10 warhead moiety and the ligand of the ABP, as this can affect the selectivity and 

specificity of the resulting ABPs. Linker moieties may be either obtained 
commercially (see, e.g., Pierce Chemical Company Catalog and Handbook 1994-95, 
pages O-90 through O-l 10, which is hereby incorporated by reference) or synthesized 
as needed. A library of molecules comprising, for example, linker chemistries 

15 exhibiting varying lengths, hydrophobicities, etc., may be constructed. Moreover, the 

library of ABPs can also be expanded by varying identity of the ligand (e.g., using a 
variety of different fluorescent moieties), and/or the location of linker moiety 
attachment point on the ligand (e.g., 5-TMR linkage vs. 6-TMR linkage), as these can 
also affect the selectivity and specificity of the resulting ABPs. 

20 [0070] In the case of a combinatorial library, as indicated above, numerous 

variations as to the structure of the ABP can be prepared. These various members may 
then be screened with a complex protein mixture to determine which members of the 
library are inactive with inactivated target protein(s), but react with active target 
protein(s). In carrying out the subject methodology, one or a plurality of ABPs may be 

25 added to a complex protein sample as described herein. 

[0071] In an alternative design strategy, a functional group may be selected 

that imparts an element of "chemical specificity" to the ABP. In these embodiments, 
the requirement for an affinity moiety to achieve adequate probe selectivity can be 
eliminated. The incorporation of an affinity moiety into an ABP can affect the number 

■ 

30 of proteins targeted by a probe. Thus, depending on the number of target proteins of 

interest and the similarity of active site binding selectivity, the artisan can choose to 
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include the affinity moiety or not, as required. For example, a fluorophosphonate 
reactive group provides a classical affinity label for serine hydrolases that selectively 
reacts with the activated serine nucleophile of catalytic triad and other serine hydrolase 
classes. 

5 [0072] While the number of ABPs that can be used is theoretically unlimited, 

preferably not more than about 20 are used. Since the sources of the protein mixture 
will vary widely and one biomolecule may have an influence on the activity of the 
ABP, as well as on the reactivity of a protein, the mixture may be subject to dilution, 
fractionation, precipitation, extraction, dialysis, chromatography or other processing to 
10 obtain the desired composition. For the most part, the composition will not be 

significantly modified, maintaining substantially the composition obtained from the 
source. In some instances the pH may be modified, solvents added, or the like, to 
enhance the reaction of the active target proteins with the ABP(s) or change the ABP 
profile as to the active target proteins. 

15 r00731 Analysis of Complex Protein Mixtures 

[0074] The methods of the present invention can be divided generally into two 

classes, referred to herein for convenience as "screening" methods and "identification" 
methods. 

[0075] "Screening" refers to methods in which one or more complex protein 

20 mixtures suspected of containing one or more target proteins are mixed with one or 

more ABPs, whereby active target proteins present in the complex protein mixture(s) 
are labeled. The proteins are subsequently digested using one or more proteases to 
generate ABP-labeled peptides. The ABPs of the present invention preferably react 
with and bind to a single residue on an active target protein, and preferably a single 
25 ABP-labeled peptide is generated from each labeled active target protein. A signal is 

then generated from the ABP-labeled peptide(s), which can be correlated to the 
presence or amount of labeled active target proteins in the original complex protein 
mixture, preferably following a separation method (e.g., slab gel or capillary 
electrophoresis, or liquid chromatography). In these methods, a capture (or 
30 sequestration) step (as described hereinafter) is optional. 
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[0076] In contrast, "identification" refers to methods in which the molecular 

weight and/or the sequence of one or more ABP-labeled peptides, generated as 
described in the preceding paragraph, are determined by mass spectroscopy ("MS"). In 
these methods, a capture (sequestration) step is typically performed to purify ABP- 
5 labeled peptide(s) prior to MS analysis. 

[0077] The terms "mass spectrometry" or "MS" as used herein refer to 

methods of filtering, detecting, and measuring ions based on their mass-to-charge 
ratio, or "m/z." In general, one or more molecules of interest are ionized, and the ions 
are subsequently introduced into a mass spectrographs instrument where, due to a 

10 combination of magnetic and electric fields, the ions follow a path in space that is 

dependent upon mass ("m") and charge ("z"). See, e.g., U.S. Patent Nos. 6,204,500, 
entitled "Mass Spectrometry From Surfaces;" 6,107,623, entitled "Methods and 
Apparatus for Tandem Mass Spectrometry;" 6,268,144, entitled "DNA Diagnostics 
Based On Mass Spectrometry;" 6,124,1 37, entitled "Surface-Enhanced Photolabile 

15 Attachment And Release For Desorption And Detection Of Analytes;" Wright et aL, 

"Proteinchip surface enhanced laser desorption/ionization (SELDI) mass spectrometry: 
a novel protein biochip technology for detection of prostate cancer biomarkers in 
complex protein mixtures," Prostate Cancer and Prostatic Diseases 2: 264-76 (1999); 
and Merchant and Weinberger, "Recent advancements in surface-enhanced laser 

20 desoxption/ionization-time of flight-mass spectrometry," Electrophoresis 21:11 64-67 

(2000), each of which is hereby incorporated by reference in its entirety, including all 
tables, figures, and claims. Molecules (e.g., peptides) in a test sample can be ionized 
by any method known to the skilled artisan. These methods include, but are not limited 
to, electron ionization, chemical ionization, fast atom bombardment, field desorption, 

25 and matrix-assisted laser desorption ionization ("MALDI"), surface enhanced laser 

desorption ionization ("SELDI"), photon ionization, electrospray, and inductively 
coupled plasma. 

[0078] As used herein, reference to a "screening" method is not intended to 

imply that a given peptide, and its source protein, cannot be identified in such 
30 methods. For example, a particular peptide may exhibit a characteristic migration rate 

in a separation method that can be used to identify the peptide. Alternatively, several 

• * # 
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separation methods and/or conditions may be employed, and the separation profile of 
migration patterns generated for a given peptide can be used to identify the peptide. 
Instead, the distinction between a screening method and an identification method is the 
use of MS to directly measure molecular weight in identification methods. 
5 Additionally, the skilled artisan will understand that a single complex protein mixture 

may be analyzed by both screening and identification methods, as exemplified in 
Figure 2. 

[0079] The terms "separation" and "separating" as used herein refer to 

methods that enrich the concentration of a molecule of interest in a particular location 

10 or container relative to other molecules originally present. For example, gel 

electrophoresis enriches the concentration of molecules that migrate at a particular rate 
relative to other molecules originally present that migrate at different rates. Numerous 
additional analytical procedures are known to the artisan for separating and analyzing 
complex protein mixtures (e.g., chromatographic methods such as HPLC, FPLC, ion 

15 exchange, size exclusion; mass spectrometry; differential centrifugation). The term 

separating does not indicate that a desired component is obtained in a "pure" form; 
only that the desired component has increased in abundance relative to the other 
components present. For example, a complex protein mixture may be separated by 
electrophoresis, whereby a desired component is separated into one or more bands or 

20 spots present on a gel. Other, undesired, components may be present in the same band 

or spot; nonetheless, the desired component will have increased in abundance in that 
band or spot. A separation profile obtained from a separation procedure can be 
expressed as an elution time or retention time using a chromatography column, a 
migration distance, Rf, migration time, or elution time using an electophoresis 

25 apparatus, or any other expression commonly used by one of skill in the art to 

distinguish between separated components. 

[0080] A particular type of separating method is referred to herein as "capture" 

or "sequestration." These methods enrich the concentration of molecules capable of 
being sequestered (e.g., by binding to one or more receptors) relative to other 
30 molecules not so capable (e.g., removed by washing out molecules that do not bind to 

a receptor). In preferred embodiments, one or more molecules are sequestered by 
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being bound to a receptor that is itself bound to a surface. Following removal of 
unbound molecules, the sequestered molecules may be released from the receptor(s) 
for further processing- As used herein with respect to receptors, the phrases 
"specifically binds" and "specific binding" refers to a molecule or molecular complex 
5 (e.g., an antibody or binding fragment thereof) that binds to its intended ligand(s) (e.g., 

a cognate antigen) with at least a 2-fold greater affinity compared to the binding of the 
receptor to molecules other than the intended ligand(s). In preferred embodiments, 
such a receptor binds to binding partners with at least a 5 -fold, more preferably at least 
a 10- fold, even more preferably at least a 100-fold, and most preferably at least a 
10 1000-fold greater affinity compared to non : partner binding. In addition to antibody- 

antigen pairs, suitable receptor-ligand pairs include but are not limited to receptor- 
hormone pairs, avidin-biotin pairs, streptavidin-biotin pairs, and metal-chelate pairs. 

[008 1 ] Labeling of active target proteins by ABPs 

[0082] In carrying out both screening and identification methods, one or a 

15 plurality of ABPs will be added to a complex protein mixture as described herein, 

where the ABPs will react with the active target proteins present. The complex protein 
mixtures may come from different sources and be used for different purposes. This 
may include a relatively pure sample of the enzyme to determine the activity in 
relation to total protein of the sample. The sample may be a single cell or a mixture of 

20 cells, a neoplastic sample or other biopsy or tissue comprising a single cell type or a 

mixture of cell types, such as tissue from an organ, e.g. heart, lung, esophagus, kidney, 
brain, blood, etc., diseased tissue or healthy tissue, etc. The cells may be prokaryotic 
or eukaryotic, vertebrate or non- vertebrate, particularly mammalian and more 
particularly human. The cells or tissues, or lysates thereof, may be prepared in a 

25 variety of ways, including fractionation, using chromatography, centrifugation, 

precipitation, fractionation, fluorescence activated cell sorting, dilution, dialysis, 
concentration, etc. The sample will usually be treated so as to preserve the activity of 
the target protein(s), so that the manner of treatment will be mild, ambient or lower 
temperatures will be used, particularly below 37°C, and other denaturing conditions 

30 will be avoided, such as organic solvents, detergents or high salts. 
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[0083] Generally, the amount of each ABP will be sufficient to react with all of 

the active target protein for the ABP expected to be in the proteome. Typically the 
amount of each ABP is present in excess over its target proteins on a molar basis. After 
incubating the reaction mixture, generally for a time for the reaction to go substantially 
5 to completion, usually not more than about 2h, generally for about 0. 1 - 60min, at a 

temperature in the range of about 20 - 40°C, the reaction may be quenched. Since the 
extent of the reaction will increase with time, the longer the time, the more cross- 
reactivity may be anticipated. A preferred time will be employed to provide the most 
favorable results, that is, the greatest level of reaction with the target proteins with the 
1 0 least cross-reactivity. 

[0084] To enhance the accuracy of the quantitation, internal standards can be 

employed, where known proteins are reacted with one or more ABPs to provide one or 
more conjugates of known composition. These internal standards can be used to 
account for sample losses during processing, account for variations in protein digest 
15 efficiencies, and/or account for variations in relative migration times in separation 

procedures. A predetermined amount of these ABP-labeled standards may be added to 
a sample, so that the sample will have a known concentration of the standard. The 
internal standard is then subjected to the same processes as the component of the 
sample. 

* 

20 [0085] Where the procedure is substantially reproducible, an internal standard 

may be a protein that is also present in the sample being analyzed. One would expect 
to obtain a predictable signal from the standard, and any increase over this amount can 
be attributed to active target protein present in the sample. Alternatively, the internal 
standard may be selected to be different from any conjugate in the sample. In 

25 particularly preferred embodiments, one or more internal standards can employ ligands 

(e.g., fluorescent moieties) that are differentially detectable in comparison to those 
used for labeling the complex protein mixture. This can be particularly advantageous 
in separation methods, such as electrophoresis or chromatography, as standard signals 
can be easily distinguished from signals obtained from the labeled active target 

30 proteins. 
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[0086] One or more internal standards may also be selected that provide, for 

example, a proteolysis site known to react slowly with the protease(s) being employed, 
and preferably selected to react more slowly than one or more proteins of interest 
present in the complex protein mixture. Such standards can be used to monitor the 
5 course of hydrolysis. 

[0087] Digesting labeled active target proteins 

[0088] Following ABP labeling of active target proteins, both screening and 

identification methods utilize protein digestion to produce ABP-labeled peptides. The 
digestion may be performed while the proteins are in solution or when the conjugates 

10 are sequestered, e.g., by receptors bound to a solid support. Digestion preferably 

employs only one protease; however, two or more, usually not more than three, 
proteases may be used. The proteases may be in solution or bound to a surface. The 
proteases may be combined in the same reaction mixture, or the sample may be 
divided into aliquots and each of the aliquots treated with a different protease. 

15 Digestion may also occur before binding to the conjugate to a support and/or a after 

the conjugates are bound to a solid support. Enzymes that find use include, but are not 
limited to, trypsin, chymotrypsin, bromelain, papain, carboxypeptidase A, B and Y, 
proteinase A and K, chymopapain, plasmin, subtilisin, clostripain etc. 

[0089] In particularly preferred embodiments, additional steps can be used to 

20 reduce the complexity of the analysis to be performed. For example, the complex 

protein mixture can be denatured following labeling, e.g., by the addition of urea, 
guanidinium salts, detergents, organic solvents, etc., in order to reduce or eliminate 
unwanted proteolysis from endogenous proteases present in the mixture. Additionally, 
cysteine residues can be reduced and alkylated to maintain the homogeneity of 

25 cysteine-containing peptides and to prevent refolding of endogenous proteases 

*\ 

following removal of the denaturant. Moreover, proteases can be combined with 
additional enzymes, such as glycosidases, phosphatases, sulfatases, etc., that can act to 
remove post-translational modifications from proteins. Examples of such post- 
firanslational modifications include, but are not limited to, glycosylations, 
30 phosphorylations, sulfations, prenylations, methylations, amidations, and 
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myristolations. Such steps can be mixed and matched by the skilled artisan, depending 
, on the requirements of a particular analysis. 

[0090] Prior to digestion, a buffer exchange step may be employed, e.g., by gel 

filtration, dialysis, etc. This step may be used to remove excess ABPs, to remove 
5 denaturant, and/or to provide suitable buffer conditions for digestion. In particularly 

preferred embodiments, buffer exchange is performed by gravity flow gel filtration. 

[0091] Digestion will be carried out in an aqueous buffered medium, generally 

at a pH in the range of about 4 to 10, depending on the requirements of the protease. 
The concentration of the protease will generally be in the range of about 6 x 10' 8 M to 

10 about 6 x 10" 6 M, more preferably in the range of about 1 .8 x 10~ 8 M to about 2 x 10' 7 

M, and most preferably about 6 x 10" 7 M (e.g., 150 ng / 10 uJL). The term "about" in 
this context means +/- 10% of a givem measurement. The time for the digestion will 
be sufficient to go to at least substantial completion, so that at least substantially all of 
the protein will have been digested. Digests may be performed at a temperature that is 

1 5 compatible with the protease(s) employed, preferably from 20°C to 40°C, most 

preferably about 37°C. Where the digestion takes place in solution, the protease may 
be quenched by any convenient means, including heating or acidification of the 
sample. Alternatively, quenching can be achieved by sequestering the fragment 
conjugates with a receptor for the ligand bound to a surface, or by addition of a 

20 protease inhibitor (e.g., E64, DIFP, PMSF, etc.). Where the proteins are bound to a 

surface, the proteases may be washed away before the bound digested protein is 
released. 

[0092] Sequestration of ABP-labeled peptides 

[0093] It is at this point that screening methods and identification methods 

25 typically diverge. Following protein digestion, ABP-labeled peptides can be 

sequestered, e.g., by binding to receptors for the ligand of one or more ABP(s) 
employed during the labeling step. This sequestration step is typically not employed in 
screening methods, as subsequent separation steps (e.g., electrophoresis, 
chromatography) and detection steps (e.g., generation of fluorescence from fluorescent 
30 ABPs) employed in screening methods are often sufficient to distinguish the various 
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ABP-labeled peptides (sequestration may be employed in screening methods when 
advantageous, such as when a sample is very dilute and sequestration can be used to 
concentrate ABP-labeled peptides). In contrast, such sequestration step(s) are 
typically employed in identification methods in order to reduce the number of peptides 
5 injected into the MS instrument, thereby simplifying MS analysis. 

[0094] Preferably, sequestration relies on receptors bound to a solid support 

that can be easily manipulated during wash steps. The support may be beads, including 
paramagnetic beads, prepared from various materials, such as Bioglas, polystyrene, 
polyacrylate, polymethylmethacrylate, polyethylene, polysaccharides, such as 
10 Agarose, cellulose, amylose, etc., polyurethane, and the like. Desirably, the support 

surface will not interfere with the binding of ligand to its cognate receptor, and the 
receptor may be linked to the support by a hydrophilic bridge that allows for the 
receptor to be removed from the surface. When beads are employed, the beads will 
generally have a cross-dimension in the range of about 5 to 100|j.. Instead of beads, 
15 one may use solid supports, such as slides, the walls of vessels, e.g. microtiter well 

walls, capillaries, etc. There is an extensive literature of receptor bound supports that is 
readily applicable to this invention, since the sequestering step is conventional. The 
sample is contacted with the support for sufficient time, usually about 5 to 60min, to 
allow all of the conjugate to become bound to the surface. At this time, all of the non- 
20 specifically bound components from the sample may be washed away, greatly 
enriching the target proteins as compared to the rest of the sample. 

[0095] Following separation by sequestration, ABP-labeled peptides may then 

be released from the. receptor. The particular method of release will depend upon the 
ligand-receptor pair. In some instances, one may use an analog of the ligand as a 

25 "releasing agent" to release the conjugate. This is illustrated by the use of deimino- or 

dethiobiotin as the ligand and biotin as the releasing agent. Where this is not 
convenient, as in the case of many fluorescent moieties as ligands where there may not 
be a convenient analog, conditions such as high salt concentrations, chaeotropic agents 
(e.g., isothiocyanate or urea) low pH, detergents, organic solvents, etc., may be used to 

30 effect release. Once the conjugate has been released, dialysis, ion exchange resins, 



* 



30 



WO 02/071066 



PCT/US02/06234 



precipitation, or the like may be used to prepare the conjugate solution for the next 
stage. 

[0096] "Screening" analysis methods 

[0097] In screening methods, a signal is then generated from the various ABP- 

5 labeled peptides obtained from the original complex protein mixture. If only a single 

ABP-labeled peptide is associated with a particular signal (e.g., if each ABP reacts 
with a single target protein and each ABP is distinguishable) a signal may be generated 
without further separation. More typically, however, one or more separation methods 
are employed to separate various ABP-labeled peptides prior to signal generation. 

10 [0098] These separation methods include, but are not limited to, liquid 

chromatography, particularly reverse phase liquid chromatography, where the 
components are separated by their differences in hydrophobicity, or electrophoresis, 
particularly slab gel or capillary electrophoresis. The electrophoresis may involve 
one- or two-dimensional electrophoresis, may be in a gel, may use a capillary or may 

15 use a channel in a microfluidic device. See e.g., Opiteck, et ah, Anal. Chem. (1998) 

258:349-61; U.S. Patent Nos. 4,415,655; 4,481,094; 4,865,707; and 4,946,794; 
Laemmli, UK, Nature (1970) 227, 680-685; and Sambrook, J.; MacCallum, P. & 
Russell, D. (2001) "Molecular Cloning: A Laboratory Manual." 3 rd Edition, Cold 
Spring Harbor Press, Cold Spring Harbor, NY.. Liquid chromatography may use a 

20 combination of size exclusion liquid chromatography followed by RP-HPLC or only 

RP-HPLC. The conditions employed are conventional for liquid chromatographic 
separation of proteins and peptides and commercial equipment and materials are 
available. See, e.g., U.S. Patent Nos. 5,041,538 and 5,290,920 and W091/15228, as 
exemplary. A suitable eluant can include a water/acetonitrile gradient, optionally 

25 containing 0. 1 % trifluoroacetic acid; 0. 1 % trifluoroacetic acid; or 0. 1 % formic acid. 

The conjugates can be monitored by their fluorescence and may be isolated in wells 
for further investigation. A separation profile in such methods may be sufficient 
information to identify the peptide and, therefore, the protein. 

[0099] For screening, a particularly preferred separation/signal generation^ 

30 method is capillary electrophoresis with laser-induced fluorescence ("CE-LIF"); 
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Capillary electrophoresis methods are well known to the skilled artisan. See, e.g., 
Kasicka, "Recent adyafices in capillary electrophoresis of Peptides," Electrophoresis 
(2001) 22:4139-62; Sanchez and Smith, "Capillary electrophoresis," Methods 
EnzymoL (1997) 289:469-78; Xu, "Capillary electrophoresis," Anal. Chem. (1995) 
5 67(12):463R-473R; Landers, James (Ed.) Handbook of Capillary Electrophoresis, 2nd 

Ed. CRC Press, 1997. The resolution of CE peptide separations can be up to 10-fold 
higher in comparison to reverse phase HPLC, and single molecule detection can be 
achieved when coupled to LIF. Additionally, multichannel detectors and multicapillary 
(e.g., 96 capillary) instruments permit multiplexed, high throughput analyses. An 
10 exemplary configuration of a CE-LIF multichannel detection instrument is shown in 

Figure 1. 

[0100] A signal can then be generated from the ABP-labeled peptide(s) 

present, and, if appropriate, the intensity compared from two or more samples. 
Detection and measurement can be achieved with a CCD, photomultiplier, etc., where 

15 the information may be transferred to a data processor for analysis. The different 

components of the samples and their relative amounts as measured by the intensity of 
their emissions can be analyzed by the data processor and a profile obtained that 
compares the number of components and the ratios between two or more samples, 
and/or results can be compared to other profiles for comparison. In this way, one 

20 obtains the characteristics of the complex protein mixture as it is affected by changes 

in the cells due to differentiation, maturation, cell type and changes in the cellular 
environment. 

[0101] "Identification" analysis methods 

[0102] As discussed above, where the migration rates in the separation stages 

25 of a screening method provide the necessary identification of the ABP-labeled 

peptide(s) and, therefore, the protein, and provide quantitation of the protein, no 
further analysis is required. However, where further confirmation of the results from 
the earlier analysis is desired or the earlier results do not provide certainty as to the 
identification and amount of a particular component, an identification method using 
30 mass spectrometry (MS) can be employed. See, for example, WO 00/1 1208. The use 

of mass spectrometry will be described below. Such identification methods potentially 
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provide greater information, but requires greater sample size in comparison to, for 
example, capillary electrohoresis, and has a lower throughput. 

[01 03] In identification methods, chromatographic and/or electrophoretic 

separation methods as described herein may be used to simplify the mixtures 
5 introduced into the mass spectrometer, allowing for a more accurate analysis. These 

separation methods may be employed prior to, following, or in lieu of sequestration of 
the ABP-labeled peptides described above. The use of fluorescent moieties as ABP 
ligands can permit the use of an online fluorescence detector to trigger ESI-MS data 
collection or fraction collection for subsequent analysis, e.g., providing sample on a 
10 MALDI plate. In this way, only fractions and bands that contain ABP-labeled peptides 

will be selected for further processing, thereby avoiding using the MS with certain 
fractions. 

[0104] In particularly preferred embodiments, the identification methods 

described herein can be combined with one or more separation methods to develop a 

1 5 "separation profile" that can be used to identify ABP-labeled peptides without the need 

for MS analysis. In these methods, a sample (e.g., material from a chromatography 
column) is divided into at least two portions; one portion is used for MS analysis, and 
the other portion(s) are used for one or more separation methods (e.g., a single CE run, 
or two or more CE runs using different separation conditions). The peptide 

20 identification obtained from the MS analysis can be assigned to the observed 

separation profile (e.g., the elution time of the peptide observed in the CE run(s)). 
Observation of this separation profile in subsequent samples can then be correlated to 
the peptide known to exhibit that separation profile. 

[0105] The identification methods described herein may also utilize ABPs that 

25 differ isotopically in order to enhance the information obtained from MS procedures. 

For example, using automated multistage MS, the mass spectrometer may be operated 
in a dual mode in which it alternates in successive scans between measuring the 
relative quantities of peptides obtained from the prior fractionation and recording the 
sequence information of the peptides. Peptides may be quantified by measuring in the 

i 

30 MS mode the relative signal intensities for pairs of peptide ions of identical sequence 

that are tagged with the isotopically light or heavy forms of the reagent, respectively, 
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and which therefore differ in mass by the mass differential encoded with the ABP. 
Peptide sequence information may be automatically generated by selecting peptide 
ions of a particular mass-to-charge (m/z) ratio for collision-induced dissociation (CID) 
in the mass spectrometer operating in the MS n mode. (Link, et al., (1997) 
5 Electrophoresis 18:1314-34; Gygi, et al., (1999) idid 20:310-9; and Gygi et al., (1999) 

Mol. Cell. Biol. 19:1720-30). The resulting CID spectra may be then automatically 
correlated with sequence databases to identify the protein from which the sequenced 
peptide originated. Combination of the results generated by MS and MS n analyses of 
affinity tagged and differentially labeled peptide samples allows the determination of 
10 the relative quantities as well as the sequence identities of the components of protein 

mixtures. 

[01 06] Protein identification by MS n may be accomplished by correlating the 

sequence contained in the CID mass spectrum with one or more sequence databases, 
e.g., using computer searching algorithms (Eng. et al. (1994) J. Am. Soc. Mass 

15 Spectrom. 5:976-89; Mann, et al., (1994) Anal. Chem. 66:4390-99; Qin, et al., (1997) 

ibid 69:3995-4001 ; CJauser, et al., (1995) Proc. Natl. Acad. Sci. USA 92:5072-76). 
Pairs of identical peptides tagged with the light and heavy affinity tagged reagents, 
respectively (or in analysis of more than two samples, sets of identical tagged peptides 
in which each set member is differentially isotopically labeled) are chemically 

20 identical and therefore serve as mutual internal standards for accurate quantitation. The 

MS measurement readily differentiates between peptides originating from different 
samples, representing different cell states or other parameter, because of the difference 
between isotopically distinct reagents attached to the peptides. The ratios between the 
intensities of the differing weight components of these pairs or sets of peaks provide 

25 an accurate measure of the relative abundance of the peptides and the correlative 

proteins because the MS intensity response to a given peptide is independent of the 
isotopic composition of the reagents. The use of isotopically labeled internal standards 
is standard practice in quantitative mass spectrometry (De Leenheer, et al., (1992) 
Mass Spectrom. Rev. 1 1:249-307). 

• » 

30 [0107] The ABP-labeled peptides may provide specific fragmentation patterns 

in MS procedures. In this manner, the fragmentation pattern of the ABP-labeled 
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peptides will aid in the identification of the proteins, or identifying which peptide(s) 
from a protein are labeled by an ABP. As an added advantage, where the ABP is 
positively charged, it will enhance the signal due to its easier ionization. 

[01 08] Isotopically distinguishable ABPs are useful when using MS detection 

5 for either the second dimension of separation, as in MALDI/TOF or when doing 

identification as in MS n . Convenient isotopic labels are 2 H, ,3 C, 15 N, ,7 0, ,s O or ^S. 
The use of the isotopically labeled ABPs also allows for the use of isotopically 
differing internal standards. The internal standard ABP will typically be otherwise 
identical to the sample ABP. 

10 [0109] The above procedures allow for analysis of complex protein mixtures 

such as proteomes. The analysis can be associated with screening of candidate 
compounds to determine the effect of the compound on the regulation of the target 
proteins, identification of the pathways that the candidate compound affects and the 
cellular response to the candidate compound. In this way the effectiveness of drugs 

15 may be analyzed, cross-reactivity determined, toxicity evaluated and other effects on 

cell physiology determined. Besides candidate compounds, the subject analysis may 
be used with natural products, environmental sample screening, and the like. In each 
case one can identify specific proteins that are affected by the environment of the cells 
and compare the effect of the environment with changes observed with other 

20 environments. The subject methods can be used in high throughput primary or 

secondary screening, where candidate compounds are evaluated for their efficacy and 
cross-reactivity, their influence on physiologic pathways and their interactions with 
other drugs that may be taken for the same indication. 

[0110] Kits can be provided that combine one or more ABPs for specific 

25 and/or related groups of proteins, with the proteolytic enzymes, in solution, but usually 

either as a lyophilized or bound to a support. Also included can be solid supports for 
the sequestration of the conjugates, both intact and fragment, where the supports may 
be any solid support, such as particles and beads of from about 5 to lOO^i, the walls of 
vessels, such as the microtiter wells of microtiter plates, capillaries, etc., to which the 
30 receptor is bound. The solid supports may be treated with an innocuous protein, such 
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as serum albumin, to occupy hot spots and inhibit non-specific binding. Pre-labeled 
internal standards may also be supplied. 

[0111] The following examples are offered by way of illustration and not by 

way of limitation. 

5 [01121 Example 1. Preparation of FP-PEG-TMR (fluorophosphonate- 

tetraethvleneglvcol-tetramethvlrhodarnine) 

[01 13] Compound la is the starting material tetraethyleneoxy (3,6,9-oxa-l,l 1- 

diolundecane) and compound lb is the starting material decylene-l,10-diol as depicted 
in the flow chart in Figure 1. 

10 [01 14] Compound 2. A solution of 1 (3.9 g, 20.0 mmol, 3.0 equiv) in DMF 

(8.0 mL) was treated with TBDMSC1 (1.0 g, 6.64 mmol, 1.0 equiv) and imidazole (0.9 
g, 13.3 mmol, 2.0 equiv) and the reaction mixture was stirred for 12 h at room 
temperature. The reaction mixture was then quenched with saturated aqueous 
NaHC03 and partitioned between ethyl acetate (200 mL) and water (200 mL). The 

15 organic layer was washed with dried (Na2S04) and concentrated under reduced 

pressure. Chromatography (Si02, 5x15 cm, 50-100% ethyl acetate-hexanes) 
afforded 2 (1.1 g, 2.0 g theoretical, 55%) as a colorless oil: 1H NMR (CDC13, 400 
MHz) 8 3.8-3.5 (m, 16H, CH20R), 0.88 (s, 9H, CH3C), 0.0 (s, 6H, CH3Si). 

[01 15] Compound 3. A solution of 2 (0.61 g, 2.0 mmol, 1 .0 equiv) in benzene 

20 (15 mL, 0. 1 3 M) was treated sequentially with PPh3 (2.6 g, 1 0.0 mmol, 5 equiv), 12 

(2.3 gj 9.0 mmol, 4.5 equiv), and imidazole (0.7 g, 10.3 mmol, 5.2 equiv) and the 
reaction mixture was stirred at room temperature for 30 min, producing a yellow- 
orange heterogeneous solution. The soluble portion of the reaction mixture was 
removed and the insoluble portion washed several times with ethyl acetate. The 
25 combined reaction and washes were then partitioned between ethyl acetate (200 mL) 

and saturated aqueous Na2S203 (200 mL). The organic layer was washed 
sequentially with H20 (100 mL) and saturated aqueous NaCl (100 mL), dried 
(r*Ta2S04), and concentrated under reduced pressure. Chromatography (Si02, 5x15 
cm, 5-25% ethyl acetate-hexanes) afforded 3 (0.54 g, 0.82 g theoretical, 66%) as a 
30 colorless oil: 1H NMR (CDC13, 400 MHz) 5 3.85-3.60 (m, 12H, CH20R), 3.54 (t, J = 
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5.6, 2H, CH20TBDMS), 3.23 (t, J = 7.0 Hz, 2H, CH2I), 0.88 (s, 9H, CH3C), 0.0 (s, 
6H, CH3Si). 

[01 16] Compound 4. Triethylphosphite (1 .2 mL, 7.0 mmol, 5.4 equiv) was 

added to 3 (0.53 g, 1.29 mmol, 1.0 equiv) and the mixture was stirred at 150°C for 1 h. 
5 The reaction mixture was cooled to room temperature and directly submitted to flash 

chromatography (Si02, 5 x 15 cm, 100% ethyl acetate) to afford 4 (0.43 g, 0.54 g 
theoretical, 80%) as a colorless oil: 1HNMR (CDC13, 400 MHz) 5 4.20-4.05 (m, 4H, 
CH3CH20P), 3.80-3.55 (m, 14H, CH20R), 2.15 (m, 2H, CH2P), 1.31 (t, J = 6.0 Hz, 
6H, CH3CH20P), 0.88 (s, 9H, CH3C), 0.0 (s, 6H, CH3Si). 

10 [01 17] Compound 5. A solution of compound 4 (0.21 g, 0.5 mmol, 1.0 equiv) 

in CH2C12 (2.8 mL, 0.18 M) was treated with HF-pyridine (0.084 mL, -0.84 mmol, 
-1 .7 equiv). The reaction was stirred at 25°C for 30 min and then partitioned between 
ethyl acetate (100 mL) and water (100 mL). The organic layer was dried (Na2S04) 
and concentrated under reduced pressure. Chromatography (Si02, 2x8 cm, 3-10% 

15 CH30H-CH2C12) afforded 5 (0.050 g, 0.28 g theoretical, 32.5%) as a clear oil: 1H 

NMR (CDC13, 400 MHz) 5 4.20-4.05 (m, 4H, CH3CH20P), 3.80-3.55 (m, 14H, 
CH20R), 2.15 (m, 2H, CH2P), 1.31 (t, J = 6.0 Hz, 6H, CH3CH20P); MALDI-FTMS 
m/z 337.1377 (C12H2707P + Na+ requires 337.1387). 

■ 

[0118] Compound 6. A solution of 5 (0.030 g, 0.096 mmol, 1 .0 equiv) in DMF 

20 (0.28 mL, 0.34 M) was treated sequentially with N,N-disuccinimidyl carbonate (0.058 

g, 0.22 mmol, 2.2 equiv) and triethylamine (0.035 uL, 0.25 mmol., 2.5 equiv). The 
reaction mixture was stirred at room temperature for 12 h and then partitioned between 
CH2C12 (100 mL) and H20 (100 mL). The organic layer was washed with saturated 
aqueous NaCl (100 mL), dried (Na2S04), and concentrated under reduced pressure. 
25 Chromatography (Si02, 2 x 8 cm, 1-10% CH30H-CH2C12) afforded 50.035 g, 0.043 

g theoretical, 81%) as a clear oil: 1H NMR (CDC13, 400 MHz) 8 4.45 (m, 2H, 
CH20C(0)OR), 4.20-4.05 (m, 4H, CH3CH20P), 3.80-3.55 (m, 12H, CH20R), 2.84 
(s, 4H, CH2C(0)N), 2.15 (m, 2H, CH2P), 1.31 (t, J = 6.0 Hz, 6H, CH3CH20P). 
MALDI-FTMS m/z 478.1456 (C17H30NO1 IP + Na+ requires 478.1449). 



37 



WO 02/071066 



PCT/US02/06234 



[0119] Compound 7. A solution of 6 (0.020 g, 0.044 mmol, 1 .0 equiv) in 

CH2C12 (0.14 mL, 0.40 M) was cooled to 0°C and treated with oxalyl chloride (0.082 
mL, 2M in CH2C12, 0.164 mM 3.7 equiv). The reaction mixture was allowed to warm '■ 
to room temperature and stirred for 1 8 h. The reaction mixture was then concentrated 
5 under a stream of gaseous nitrogen and the remaining residue treated with H20 (0. 1 

mL) for 5 min. The H20 was evaporated under a stream of gaseous nitrogen and the 
remaining residue dried by vacuum to provide 7 (0.015 mg, 0.019 mg theoretical, 
80%) as a clear oil/film: 1H NMR (CDC13, 400 MHz) 8 4.45 (m, 2H, CH20C(0)OR), 
4.10 (m, 2H, CH3CH20P), 3.80-3.55 (m, 12H, CH20R), 2.84 (s, 4H, CH2C(0)N), 
10 2.15 (m, 2H, CH2P), 1.31 (t, J = 6.0 Hz, 3H, CH3CH20P). 

[01 20] Compound 8. A solution of 7 (0.007 g, 0.016 mmol, 1 .0 equiv) in 

CH2C12 (0.22 mL, 0.075 M) at -78°C was treated with (diethylamino)sulfur trifluoride 
(DAST, 0.007 mL, 0.048 mmol, 3.0 equiv) and the reaction mixture was stirred for 10 
min . The reaction mixture was then partitioned between ethyl acetate (100 mL) and 

15 H20 (100 mL) and the organic layer was washed with saturated aqueous NaCl (100 

mL), dried (Na2S04), and concentrated under reduced pressure. Chromatography 
(Si02, Pasteur pipette, 100% ethyl acetate) afforded 8 (0.003 g, 0.007 g theoretical, 
42%) as a clear oil: 1H NMR (CDC13, 400 MHz) 5 4,45 (m, 2H, CH20C(0)OR), 4.27 
(m, 2H, CH3CH20P), 3.80-3.55 (m, 12H, CH20R), 2.84 (s, 4H, CH2C(0)N), 2.32- 

20 2.26 (m, 2H, CH2P), 1 .3 1 (t, J = 6.0 Hz, 3H, CH3CH20P). 

[0121] Compound 9. A solution of tetramethylrhodamine ethylenediamine 

(Molecular Probes, Eugene, OR) (0.005g, 0.010 mmol, 1.0 equiv) in DMF (0.5 mL, 
0.020 M) was added to compound 8 (neat, 0.007g, 0.016 mmol, 1.7 equiv) and the 
reaction mixture was stirred for 30 min at room temperature. The solvent was 

25 removed under vacuum and the products were resuspended in a 0.35 mL of a water- 

acetonitrile mixture (1:1 v./v.) containing 0.1% (v./v.) trifluoroacetic acid. An aliquot 
of this solution (0.30 mL) was injected on a preparative reverse phase HPLC column 
(Haisil 100 C8, Higgins Analytical, 20 mm x 150 mm), separated using a 0-100% 
acetonitrile gradient in 30 minutes at 10 mL per min. The retention time under these 

30 conditions was 19.95 min. The solvent was removed under vacuum using a rotary 

evaporator, and afforded 9 (0.0035 g, 0.0042 mmol, 42%) as a darkly colored oil. 

38 



WO 02/071066 PCT/US02/06234 

[0122] Example 2. Complex protein mixture analysis 

[0123] General Materials and Methods. Activity based probes were 

synthesized as described in Example 1 . Rat fatty acid amide hydrolase (FAAH) was 
expressed in E. coli and purified using standard protocols. Porcine trypsin and equine 
5 butyrylcholinesterase were purchased from Sigma corp. Urokinase plasminogen 

activator, tissue plasminogen activator, prostate specific antigen, and bovine trypsin 
were purchased from Calbiochem (San Diego, CA). 

[0124] Labeling of purified serine hydrolases. Each purified serine 

hydrolase was diluted to 0.01-0.05 mg/mL in reaction buffer (50 mM Tris pH 7.0, 100 
10 mM NaCl) containing 1 mg/mL heat denatured rat liver soluble proteome. The 

mixtures were labeled for 1 h at 20 °C with 2 uM FP-Peg-TAMRA. Unreacted probe 
was then removed by gel filtration on 10DG columns (Bio-Rad). 

[0125] Preparation of internal standard proteins. Bovine trypsin, porcine 

trypsin, and porcine elastase were diluted to 0.25 mg/mL in reaction buffer and labeled 
15 with 20 uM FP-Peg-BodipyFL for 30 min. Excess probe was removed by gel 

filtration on 10DG columns (Bio-Rad). The proteins were mixed together and l/30 th 
volume of this cocktail was used added to each sample as an internal standard. 

[0126] Preparation and digestion of complex protein mixtures. For studies 

involving purified serine hydrolases, the eight labeled proteins were mixed together 

20 and 3 uL of the cocktail was added to 27 uL of either rat liver soluble proteome (1.5 

mg/mL protein), rat liver membrane proteome (1 .5 mg/mL protein), rat brain soluble 
proteome (1.5 mg/mL protein), rat brain membrane proteome (1.5 mg/mL protein), 
MDA-MB-231 conditioned media (0.5 mg/mL protein), MDA-MB-435 conditioned 
media 0.5 mg/mL protein), whole yeast lysate (8 mg/mL), or human plasma (50 

25 mg/mL protein). FAAH inhibition experiments were conducted by pre-incubating rat 

brain or testis membrane fractions (1.5 mg/mL protein, in reaction buffer) in the 
presence or absence of 50 uM oleoyl trifluoromethylketone for 15 min followed by 
the addition of 5 uM FP-Peg-TAMRA and incubation at 20 °C for 1 h.. 



30 



[0127] Following labeling, the proteomes were denatured by adding 1 volume 

of 12M urea (prepared at 50 °C to allow for complete dissolution of the urea), and 
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1/100 volume of DTT from a fresh 1M stock. The mixture was then heated to 65 °C 
for 15 min. Iodoacetamide was added to 40 mM from a fresh 1M solution and the 
solution was incubated at 37 °C for 30 min to alkylate available cysteine residues. 

[0128] Prior to trypsin digest, the buffer was exchanged by gel filtration to 

remove excess reagents and unreacted probe, and reduce the urea concentration to a 
suitable concentration (2M) for trypsin activity. Gel filtration was performed using 
sephadex G-25 superfine resin (Amersham Biosciences) in custom 96-well filter plates 
from Innovative Microplate. The custom filterplates were designed to support gravity 
flow and allow collection volumes as small as 20 |iL. Gel filtration was performed 
with 20 nL or 50 jiL of sample using 350 jiL or 600 \jlL of hydrated beads respectively. 
Each well was rinsed prior to use with 5 bed volumes of digestion buffer (2M urea 10 
mM ammonium bicarbonate). Samples were applied to columns, followed by a 
defined amount of digestion buffer (150 jiL for 350 \xL columns, 200 jiL for 600 nL 
columns). Three fractions of 20 nL or 50 were then collected into 96-well plates. 
The highest concentration of protein was consistently observed to elute in the second 
fraction. 

[0129] Following gel filtration, the samples were treated with 150 ng trypsin 

(sequencing grade modified trypsin, Promega) per 10 jiL of sample for 1 h at 37 °C. 
Digests were stopped and prepared for CE by addition of l/20 th volume each of 200 
mM citrate, and 1 0% triton X- 1 00. 

[0130] Capillary Electrophoresis, Capillary electrophoresis was performed 

using a Beckman PACE MDQ CE system. Excitation was provided by a 1 00 m W, 
frequency doubled (532 nra) Nd:YAG laser (Coherent inc.). The laser was mounted 
on an optical breadboard and focused into one end of a 100 jiM fiber optic cable using 
a laser-fiber coupler (OZ optics). The other end of the fiber optic cable was interfaced 
with the Beckman CE using an SMA 905 connector. The laser power measured at the 
capillary launch point was 25 mW. Fluorescence was collected into two channels, one 
for the TAMRA signal and one for the BodipyFL signal. The TAMRA channel 
contained a 532 nm notch filter, followed by a 580 nm bandpass filter (Omega 
Optical). The BodipyFL channel contained a 520 nm short pass filter, followed by a 
510 nm bandpass filter. 
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[0131] Samples were separated in 100 jiM X 60 cm eCAP DNA capillaries 

(Beckman Coulter inc.). The running buffer contained 50 mM aspartic acid, 10 mM 
hepes, 0.2% triton X-100 (pH 3.1 without . adjustment). Prior to each run, capillaries 
were rinsed at 20 psi for 1.5 min. Samples were injected for 6 s at 0.1 psi (20 nL 
5 estimated injection volume) then separated by applying 25 kV (inlet=anode) for 45 

minutes. The measured current under these conditions was 23 |iA. 

[0132] The results of a typical analysis are shown in Figure 3, which displays a 

capillary electrophoresis profile for peptides generated from eight serine hydrolases 
digested in a complex protein mixture (liver cytosol). Two different labeled peptides 
10 were generated by proteolysis of each of two of these proteins (porcine and bovine 

trypsin), while single peptides were generated from the remaining six proteins. The 
reproducibility of these results across several CE procedures is shown in the table 
below: 



Protein 


0/ 

/o 


RMSD % 


RMSD % 




' 0.073006 


15.81026 


13.90155 




0.019949 


1 3.64885 


1 5.25825 




0.098537 


22.37438 


21.71691 




0.101285 


37.06117 


40.10104 


MA 


0.104278 


26.65682 


28.78821 




0.105724 


41.79855 


45.2839 


WAMfi 


0.129092 


20.59908 


20.97169 




0.331954 


27.00883 


23.02499 


BBswQod® tiEOTBfiflQ iM 


0.128533 


40.09026 


45.68084 




0.090917 


39.96854 


40.26177 



25 
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[01331 Example 2. ABP-labeled peptide identification and CE migration 

determination 

[0134] A complex protein mixture is labeled with an ABP containing TAMRA 

5 as the ligand. The sample is then processed, and digested as in example 1. For peptide 

identification from a proteome, typically 1-5 mg of total protein is used. Following 
digestion, 1/1 0 th volume of 10% SDS (w/v) is added to the sample and it is heated to 
65 °C for 5 min. The sample is then mixed with an equal volume of 2X binding buffer 
(2% Triton X-l 00, 1 % Tergitol NP40 type, 300 mM NaCl, 2 mM EDTA, 20 mM Tris 

10 pH 7.4), before adding 20-100 of beads containing anti-TAMRA antibodies. The 

mixture is rotated at ambient temperature for 1 h. The beads are pelleted by 
centrifiigation and the supernatant removed. The beads are then washed 3 times with 
lmL of IX binding buffer containing 1% SDS, followed by 3 washes with 50 mM Tris 
pH 8.0, and 2 washes with deionized water. Captured peptides are eluted from the 

15 beads with 2 bead volumes of 50% acetonitrile/H 2 0 + 0.5% trifluoroacetic acid. The 

eluted peptides are concentrated to approximately 1 0-20 jxL before proceeding. 

[0135] The sample of affinity enriched ABP labeled peptides is injected onto a 

300 |iM CI 8 reverse phase HPLC column at a flow rate of 5 \xL per minute. Peptides 
are separated over a 2h gradient from 5-50% acetonitrile + 0.1% formic acid in H 2 0 + 

20 0.1% formic acid. The eluant of the HPLC column is split, sending approximately 

0.5-1 nL per minute to an ESI-ion trap mass spectrometer, and the other 4-4.5 |iL per 
minute to a capillary fluorescence detector and a fraction collector in series (see Figure 
2). During the run, the mass spectrometer alternates between MS mode and MS/MS 
mode to obtain all data necessary to identify the proteins giving rise to the observed 

25 peptides. Fractions are collected at 0.5 min intervals into a V-bottom 96 well-plate 

pre-loaded with 10 \\L of 50 mM aspartic acid containing internal standard peptides. 

■v 

[0136] Following the run, MS data is searched using a program such as 

SEQUEST or MASCOT, with the appropriate mass of the ABP tag included in the 
search. The presence of the ABP tag on each identified peptide can be confirmed by 
30 the presence of a fluorescence signal at the appropriate time for each species. Once 

the species present in the sample are identified, their positions in the 96-well can be 
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determined and these fractions can be run on a CE instrument to determine their 
migration patterns. Additionally, since the CE uses minimal amounts of sample (20 
nL), excess material can be stored for future reference. 

[0137] The subject methods provide an efficient means for screening cells for 

5 their responses to change in status and environment. The methodology allows for 

substantial simplification of the sample to be analyzed. By using a combination of 
ABPs and digestion, the resulting fragment conjugates can be more readily analyzed 
for the proteins present in the proteome. The ABP provides substantial assurance that 
the conjugate will have a single probe at a single site. Upon digestion this will lead to 

10 a single fragment conjugate. The proteolysis will substantially reduce the likelihood of 

having a glycolytically modified amino acid, particularly having fragments that are 
differentially glycosylated. In this manner, the number of different fragments 
associated with the same protein is diminished. There is also less likelihood of other 
artifacts being introduced that increase the number of fragments resulting from a single 

15 protein. 

[0138] For status, one can determine the regulation of target proteins as cells 

differentiate, mature or change their characteristics. For environment, one can 
evaluate candidate drugs in real time, using an automatable system that allows for the 
direct comparison of a plurality of drugs. The target proteins can be quantitated. 

20 Since the target proteins are in their active state, a dynamic picture of the state of the 

cell is determined. This means that one is obtaining a true picture of how the cell 
responds to the change in environment and the pathways that are active in the presence 
of the candidate drug. The subject methods provide for a more accurate and complete 
understanding of the effect of a drug. By virtue of the methods, one can perform 

25 numerous assays with a single drug, varying the cells to obtain a more complete 

picture of the effect of the drug on a host. By determining a spectrum of effects, a 
better understanding of the activity of the drugs is obtained. 

[0139] In addition, the subject methods permit different levels of analysis as to 

the time required for the determinations. In those cases where quantitation can be 
30 achieved from the migration of the conjugate, rapid parallel determinations can be 

made. Where further analysis is required, the capabilities of mass spectrometry are 
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enlisted, where the presence of a fluorescent marker permits selection of only those 
fractions or bands that contain components of interest. 

[0140] Unless otherwise defined, all technical and scientific terms used herein 

have the same meaning as commonly understood by one of ordinary skill in the art to 
5 which this invention belongs. All publications, patent applications, patents, and other 

references mentioned herein are incorporated by reference in their entirety. In case of 
conflict, the present specification, including definitions, will control. In addition, the 
materials, methods, and examples are illustrative only and not intended to be limiting. 

[0141] While the invention has been described in detail with reference to 

10 certain preferred embodiments thereof, it will be understood that modifications and 

variations are within the spirit and scope of that which is described and claimed. 
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WHAT IS CLAIMED IS: 

1 . A method for determining the presence in a complex protein mixture of 
5 at least one target protein in its active state employing a probe composition comprising 

at least one labeled probe including a ligand and detectable in a liquid chromatography 
or electrophoretic separation, wherein each of said probes in said composition has 
specificity for one or a related family of proteins, said method comprising: 

conjugating a first sample complex protein mixture with a first probe 
10 composition, whereby conjugates of said probes and said target proteins are produced; 

sequestering said conjugates with a solid support to which is bound receptor for 
said ligand; 

proteolytically digesting said conjugates either prior or subsequent to said 

sequestering to produce fragment conjugates; 
15 releasing said fragment conjugates; and 

separating said fragment conjugates by at least one of liquid chromatography, 

electrophoretic separation or mass spectrometry based on the different migration rates 

of said fragment conjugates, using said label to detect said fragment conjugates, or for 

mass spectrometry based on ion peaks, 
20 whereby the elution or migration rate of said fragment conjugates indicates the 

presence of said target protein and/or for mass spectrometry the ion peaks indicate the 

presence of said target protein. 

2. A method according to Claim 1 , wherein said separating is with liquid 
25 chromatography to produce eluate fractions and comprising the additional step of 

analyzing said fractions using mass spectrometry. 

3. A method according to Claim 2, wherein said mass spectrometry 
comprises matrix-assisted laser desorption ionization ("MALDI") or electrospray 

30 ionization. 

4. A method according to Claim 2, wherein said mass spectrometry is 

MS". 
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5. A method according to Claim 4, wherein said receptor is antibody 
directed against a fluorescent moiety. 

6. A method for determining the presence in a complex protein mixture of 

« 

5 at least one target protein in its active state employing a probe composition comprising 

at least one labeled probe including a ligand and detectable in a liquid chromatography 
or electrophoretic separation, wherein each of said probes in said composition has 
specificity for one or a related family of proteins, said method comprising: 
conjugating a first sample complex protein mixture with a first probe 

i 

10 composition and a second control complex protein mixture with a second probe 

composition, where said first and second probe compositions differ in having 
independently detectable labels to produce conjugates of said probes and said target 
proteins in said complex protein mixtures; 

separating said conjugates in said first and second complex protein mixtures by 
1 5 binding said conjugates to a solid support by means of a receptor for said ligand to 

provide bound conjugates; 

washing said bound conjugates free of non-specifically bound components of 
said complex protein mixtures, followed by releasing said conjugates; 
proteolytically digesting proteins at a time prior to releasing; 
20 enriching released conjugate components by separating conjugates by at least 

one of liquid chromatography and electrophoresis with detection of conjugates by 
means of said detectable probe; 

characterizing said conjugates by their migration rates in said enriching step. 

25 7. A method according to Claim 6, including the additional step of further 

analyzing said conjugates by mass spectrometry for identification and quantification. 

8. A method according to Claim 6, including the additional steps of: 
eluting from said liquid chromatography individual fractions in wells; 
30 screening said wells for the presence of said detectable probe; and 

transferring fractions containing said detectable probe for analyzing by said 
mass spectrometry. 
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9. A method according to Claim 6, wherein said label is a fluorescent label 
and said separating is with antibodies to said fluorescent label. 

10. A method according to Claim 9, wherein said fluorescent labels in said 
first and second probe compositions are matched to similarly affect the migration of 
said conjugates. 

11. A method according to Claim 6, wherein said proteolytically digesting 
is with two different proteases. 

1 2. A method according to Claim 6, wherein said second complex protein 
mixture differs from said first complex protein mixture in being denatured by a non- 
covalent denaturing agent. 

13. A method according to Claim 6, wherein the amount of said conjugate 
is determined by the relative amounts of said detectable probes in said conjugates of 
the same target protein from said first and second complex protein mixtures. 

14. A method for determining the presence in a complex protein mixture of 
at least one target protein in its active state employing a probe composition comprising 
at least one isotopically labeled probe comprising a label detectable in a liquid 
chromatography or electrophoretic separation, wherein each of said probes in said 
composition has specificity for one or a related family of proteins, said method 
comprising: 

conjugating a first sample complex protein mixture with a first probe 
composition and a second control complex protein mixture with a second probe 
composition, where said first and second probe compositions differ in their isotopic 
labeling to produce conjugates of said probes and said target proteins; 

enriching conjugate components by separating conjugates by at least one of 
liquid chromatography and electrophoresis with detection of conjugates by means of 
said detectable label to provide separately enriched fractions; 

analyzing said separately enriched fractions by mass spectrometry and 
obtaining a comparison of the amount of conjugates from each of said first and second 
complex protein mixtures; 
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whereby the mass spectrometry provides identification and quantitation of said 
target proteins. 

15. A method according to Claim 14, wherein said mass spectrometry is 
5 MALDI and including the steps of: 

introducing fractions into a MALDI plate; 

detecting enriched fractions containing conjugates by means of said detectable 

label. 

10 16. A method according to Claim 9, wherein said mass spectrometry is ESI. 

17. A method for determining the presence in a complex protein mixture of 
at least one target protein in its active state employing a probe composition comprising 
at least one labeled probe including a ligand and detectable in a liquid chromatography 
15 or electrophoretic separation, wherein each of said probes in said composition has 

specificity for one or a related family of proteins, said method comprising: 

conjugating a first sample complex protein mixture with a first probe 
composition, whereby conjugates of said probes and said target proteins are produced; 

combining said conjugates with a at least one protease bound to a solid surface 
20 and incubating for sufficient time for fragment conjugates to be formed; 

sequestering said conjugates with a solid support to which is bound receptor for 
said ligand; 

proteolytically digesting said conjugates either prior or subsequent to said 
sequestering to produce fragment conjugates; 
25 releasing said fragment conjugates; and 

separating said fragment conjugates by at least one of liquid chromatography or 
electrophoretic separation based on the different migration rates of said fragment 
conjugates, using said label to detect said fragment conjugates, 

whereby the elution or migration rate of said fragment conjugates indicates the 
30 presence of said target protein. 

,18. A method according to Claim 17, wherein fragment conjugates from 
fractions of said liquid chromatography are analyzed fry mass spectrometry. 
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19. A method according to Claim 17, wherein fragment conjugates from 
said capillary electrophoresis are analyzed by mass spectrometry. 

20. A kit comprising a plurality of affinity based labeled probes comprising 
5 a ligand, at least one protease, and a protein standard. 

21 . A method for determining the presence, amount, or activity of one or 
more active target proteins in a complex protein mixture, the method comprising: 

(a) contacting said complex protein mixture with an activity based probe that 
1 0 specifically binds to one or more active target proteins; 

(b) proteolyzing said active target protein(s) to produce a product mixture; 

(c) separating said product mixture into one or more components, one or more 
of which comprise peptides bound to said probe; and 

(d) generating a signal from said peptides bound to said probe, wherein said 
15 signal is correlated to the presence, amount, or activity of said one or more active 

target proteins in said complex protein mixture. 



22. A method according to claim 21, wherein said separating step (c) 
comprises sequestering one or more peptides bound to said probe using a receptor that 

20 specifically binds to said probe. 

23. A method according to claim 22, wherein said probe comprises a 
fluorescent moiety, and said receptor is an antibody or fragment thereof that binds to 
said fluorescent moiety. 

25 

24. A method according to claim 21 , wherein said probe comprises a 
fluorescent moiety, and said signal is a fluorescent signal generated from said probe. 



25. A method according to claim 21 , wherein said signal is a mass 
30 spectrum. 
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26. A method according to claim 21 , wherein, prior to said proteolyzing 
step (b), one or more components of said complex protein mixture not bound to said 
probe are removed from said complex protein mixture. 

5 27. A method according to claim 21 , wherein said probe comprises a label 

selected from the group consisting of a fluorescent moiety and an isotopic label. 

28. A method according to claim 21, wherein said separating step (c) 
comprises one or more separation methods selected from the group consisting of 
10 affinity separation, gel electrophoresis, capillary electrophoresis, liquid 

chromatography, HPLC, electrospray ionization and MALDI. 



29. A method according to claim 21, wherein prior to said proteolyzing 
step (b), said one or more active target proteins bound to said probe are bound to a 

15 solid support. 

30. A method according to claim 21, wherein said method further 
comprises adding one or more standard proteins to said complex protein mixture prior 
to said proteolysis step (b). 

20 

31. A method according to claim 30, wherein said standard protein(s) are 
labeled with an activity based probe prior to addition to said complex protein mixture. 

32. A method according to claim 3 1 , wherein said standard protein(s) are 
25 labeled with an activity based probe comprising a fluorescent moiety that is 

distinguishable from said activity based probe contacted with complex protein mixture. 



33. A method for comparing the presence, amount, or activity of one or 
more active target proteins in two complex protein mixtures, the method comprising: 
30 (a) contacting a first complex protein mixture with a first activity based probe, 

and contacting a second complex protein mixture with a second activity based probe, 
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wherein each of said first and second probes specifically bind to one or more active 
target proteins in each complex protein mixture; 

(b) combining said first and said second complex protein mixtures to form a 
combined complex protein mixture; 
5 (c) proteolyzing said active target protein(s) in said combined complex protein 

mixture to produce a product mixture; 

(d) separating said product mixture into one or more components, one or more 
of which components comprise peptides bound to said first probe and peptides bound 
to said second probe; and 
10 (e) comparing a first signal generated from said peptides bound to said first 

probe, and a second signal generated from said peptides bound to said second probe, 
wherein said comparison is correlated to the relative presence, amount, or activity of 
said one or more active target proteins in said first and second complex protein 
mixtures. 

15 

34. A method according to claim 33, wherein said separating step (d) 
comprises sequestering one or more peptides bound to said first probe using a first 
receptor that specifically binds to said first probe and sequestering one or more 
peptides bound to said second probe using a second receptor that specifically binds to 

20 said second probe. 

35. A method according to claim 34, wherein said first and second probes 
comprise a fluorescent moiety, and said receptors are antibodies or fragments thereof 
that bind to said fluorescent moiety. 



25 



36. A method according to claim 34, wherein said first and second probes 
each comprise a fluorescent moiety, said first and second signals are fluorescent 
signals generated from said first and second probes, and said first and second signals 
are distinguishable from one another. 



30 
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37. A method according to claim 33, wherein said first and second probes 
are isotopically different, said first and second signals are mass spectra, and said first 

« 

and second signals are distinguishable from one another. 

5 38. A method according to claim 33, wherein, prior to said proteolyzing 

step (b), one or more components of said combined complex protein mixture not 
bound to said first or second probes are removed from said complex protein mixture. 

39. A method according to claim 33, wherein said first and second probes 
10 each comprise a label selected from the group consisting of a fluorescent label and an 

isotopic label. 

40. A method according to claim 33, wherein said separating step (d) 
comprises one or more separation methods selected from the group consisting of 

15 affinity separation, gel electrophoresis, capillary electrophoresis, liquid 

chromatography, HPLC, electrospray ionization, and MALDI. 

41 . A method according to claim 33, wherein prior to said proteolyzing step 
(b), said one or more active target proteins bound to said first and second probes are 

20 bound to a solid support. 

42. A method according to claim 33, wherein said method further 
comprises adding one or more standard proteins to said first, second, or combined 
complex protein mixture prior to said proteolysis step (b). 

25 

43. A method according to claim 42, wherein said standard protein(s) are 
labeled with an activity based probe prior to addition to said first, second, or combined 
complex protein mixture. 

30 44. A method according to claim 43, wherein said standard protein(s) are 

labeled with an activity based probe comprising a fluorescent moiety that is 
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distinguishable from said activity based probe contacted with complex protein 
mixtures. 

45. A method of correlating a separation profile to a peptide having a 

5 known sequence, wherein said peptide is bound to an activity based probe, the method 

comprising: 

generating said separation profile for said peptide by performing one or more 
separation methods and generating a signal from said peptide bound to said probe, 
wherein said separation profile is characteristic of said peptide having said known 
10 sequence. 

46. A method according to claim 45, wherein said method further 
comprises obtaining the known sequence of said peptide by mass spectrometry. 

1 5 47. A method according to claim 46, wherein said method further 

comprises: 

(a) contacting a protein sample comprising an amino acid sequence encoding 
said peptide with an activity based probe that specifically binds to said amino acid 
sequence; 

20 (b) proteolyzing said protein sample to produce a product mixture; 

(c) separating peptides bound to said probe from said product mixture; 

(d) generating a mass spectrum from said peptides bound to said probe, 
wherein said mass spectrum provides the known sequence of said peptide; and 

(e) generating said separation profile by capillary electrophoresis of said 

25 peptides bound to said probe, wherein said separation profile is one or more migration 

time(s) of said peptides bound to said probe. 

48. A method according to any one of claims 1, 6, 14, 17, 21, or 33, 
wherein said complex protein mixture is a proteome. 

30 
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